Microphone array, monitoring system, and sound pickup setting method

ABSTRACT

A camera and a microphone array configuring a monitoring system are capable of receiving electric power from a PoE apparatus through a LAN cable. In a case where a first switching operation is performed on a microphone array side, an output terminal of an input switch is connected to an input terminal of a PoE electric power reception circuit side. An input terminal of an output switch is connected to an output terminal of a PoE electric power transmission circuit side. On a camera side, an output terminal of the input switch is connected to an input terminal of a PoE electric power receptor side. The microphone array receives electric power that is supplied from the PoE apparatus for operation and transmits the electric power towards the camera. The camera receives the supplied electric power from the PoE apparatus through the microphone array and the LAN cable for operation.

CROSS-REFERENCE TO RELATED APPLICATIONS

This is a continuation application of U.S. patent application Ser. No.15/528,552, filed May 22, 2017, which is a National Stage Application ofPCT/JP2015/006127, filed Dec. 9, 2015. The disclosures of thesedocuments, including the specifications, drawings and claims areincorporated herein by reference in their entirety.

TECHNICAL FIELD

The present disclosure relates to a microphone array, a sound pickupsystem and a sound pickup setting method, in which directionality forvoice that is sound-picked up is formed in a predetermined position andis output.

BACKGROUND ART

In a monitoring system that is installed in a prescribed position (forexample, a ceiling or a wall) within a factory, a store (for example, aretail store or a bank), a shopping center, and a public place (forexample, a station or a library), a plurality of camera apparatuses areconnected through a network, and monitoring of picture data (which,hereinafter, includes a still image and a moving image) in a prescribedrange for a monitoring target is performed in a monitoring apparatusthat is installed in one place.

However, in monitoring of only a picture, because there is necessarily alimitation on an amount of information that is acquired, there is anincreasing demand for a monitoring system that can obtain voice data inorder to perform monitoring that uses voice.

In response to this demand, there is a product in which a microphone ismounted in one or several camera apparatuses and audio data istransmitted to a network in a state of being superimposed on picturedata. However, in most cases, the microphone that is used in thisproduct is non-directional, and for example, although the microphone isunidirectional, this directional characteristic is a wide angle.Therefore, in most cases, sound that is desired to be heard in themonitoring is drowned out by noise and thus is not heard.

In contrast, in a monitoring camera system, for example, as is disclosedin PTL 1, a microphone array that satisfies a requirement for detectinga direction in which abnormal sound occurs or for desirably hearing onlysound in a specific direction while being drowned out by noise is underdevelopment. The microphone array in PTL 1 is used in order that aplurality of microphone units are arranged and in order that voice froma place designated by a user is reproduced on a screen on which apicture that is image-captured in a monitoring camera (for example, anomnidirectional camera) is displayed, in a highlighting manner, usingthe voice that is sound-picked up in each microphone unit.

However, in an existing monitoring system that is installed for the mainpurpose of monitoring of a picture by the monitoring camera (hereinaftersimply referred to as a camera), in a case where a microphone array formonitoring voice is additionally installed later, in most cases, anattachment place is a high-height place, such as a ceiling. Thus, workfor the additional installation is difficult.

FIGS. 13A to 13C are diagrams for describing later additional work in acase where microphone array 105 is attached to camera 103 that isalready installed, on which existing LAN cables 191 and 191A are laid.FIG. 13A illustrates an installation example in which camera 103 isalready installed. Camera 103 is installed on ceiling 118. Camera LANconnector 131 is provided on a rear-side end portion of a case of camera103. LAN cable 191 of which an end portion (a plug) is inserted intocamera LAN connector 131 is laid through hole 118 a that is formed inceiling 118 facing a rear side of the case of camera 103, and isconnected to PoE apparatus 106 that is linked to network 109. Camera 103is connected to network 109 through LAN cable 191 in such a manner thatcommunication is possible, and operates with supply of electric powerfrom PoE apparatus 106 that is present between camera 103 and network109.

FIG. 13B illustrates an installation example in which microphone array105 is additionally installed later to camera 103 that is alreadyinstalled. Microphone LAN connector 151 is provided within an openingthat is formed in a case of ring-type microphone array 105 which isinstalled in such a manner to externally fit camera 103, and LAN cable191B for a microphone, of which an end portion (a plug) is inserted intomicrophone LAN connector 151, is connected to PoE apparatus 106 in sucha manner to run along LAN cable 191A for a camera. That is, two cables,long LAN cable 191B for a microphone and LAN cable 191A for a camera,which have almost the same length, are laid within a space between aroof and a ceiling.

FIG. 13C illustrates other one installation example in which microphonearray 105 is additionally installed later to camera 103 that is alreadyinstalled. PoE hub 108 that causes LAN cable 191 which is linked to PoEapparatus 106 to ramify is installed within the space between the roofand the ceiling. Short LAN cable 192 for a camera is connected betweenone connector of PoE hub 108 and camera LAN connector 131, and short LANcable 193 for a microphone is connected between the other connector ofPoE hub 108 and microphone LAN connector 151.

FIGS. 14A and 14B are diagrams for describing later additional work in acase where microphone array 105 is attached to camera 103 that isalready installed, on which existing LAN cables 191 and DC cable 194 arelaid.

FIG. 14A illustrates an installation example in which camera 103 isalready installed. Camera 103 is installed on ceiling 118. Camera LANconnector 131 and camera DC connector 132 are provided on a rear-sideend portion of the case of camera 103. In the same manner as in FIG.13A, LAN cable 191 of which an end portion (a plug) is inserted intocamera LAN connector 131 is laid through hole 118 a that is formed inceiling 118 facing a rear side of the case of camera 103, and isconnected to connector 109. In this case, LAN cable 191 is laid in amanner that is dedicated for communication, not for the PoE apparatus.Furthermore, DC cable 194 of which an end portion (a plug) is insertedinto camera DC connector 132 is linked to electric power sourceapparatus 116, such as an AC adaptor, which is installed in the spacebetween the roof and the ceiling. Camera 103 is connected to network 109through LAN cable 191 in such a manner that the communication ispossible, and operates with supply of electric power from electric powersource apparatus 116.

FIG. 14B illustrates an installation example in which microphone array105 is additionally installed later to camera 103 that is alreadyinstalled. Microphone LAN connector 151 and microphone DC connector 152are provided within the opening that is formed in the case of ring-typemicrophone array 105. In the same manner as in FIG. 13B, LAN cable 191Bfor a microphone, which is connected to microphone LAN connector 151, islaid in such a manner as to run along LAN cable 191A for a camera. Onthe other hand, DC cable 195 for a microphone, of which an end portion(a plug) is inserted into microphone DC connector 152 is connected toelectric power source apparatus 116 in such a manner as to run along DCcable 194 for a camera. Therefore, a total of four cables, long DC cable195 for a microphone and DC cable 194 for a camera, which have almostthe same length, in addition to long LAN cable 191B for a microphone andLAN cable 191A for a camera, which has almost the same length, areprovided with the space between the roof and the ceiling.

In this manner, in a case where later additional work for adding amicrophone array that monitors voice to the camera that is alreadyinstalled is performed, an amount of work for installation increases.Therefore, it is desirable that the amount of work for installation isreduced. Furthermore, it is also desirable that a small amount of workfor installation is performed in initial work for newly installing amonitoring system.

Additionally, in a case where an omnidirectional camera and a microphonearray are attached on the same axis, because coordinates of a pictureand voice are consistent with each other (more precisely, an imagecapture direction from the omnidirectional camera and a direction inwhich voice that is sound-picked up by the microphone array isemphasized are the same), a problem does not particularly occur, but ina case where the monitoring camera and the microphone array are attachedseparately from each other, a positional relationship between themonitoring camera and the microphone array is unclear. Therefore, forexample, when initial setting is performed, if a correspondencerelationship between a coordinate system of the monitoring camera and acoordinate system of the microphone array is not obtained, it isdifficult to form directionality of voice in a direction toward apredetermined position in which the monitoring camera captures an imagewhile the monitoring is in progress.

Particularly, for the reason of strength or a structure of a building ora structured body, when there is a limitation on an attachment positionof the monitoring camera or the microphone array, information on anactual attachment position is not obtained precisely. Therefore, inorder to obtain the correspondence relationship between the coordinatesystem of the camera and the coordinate system of the microphone array,if the attachment position or direction or the like has to be measuredon the spot, work that requires a lot of labor is necessary in additionto a dedicated measuring instrument.

An object of the present disclosure is to reduce an amount of work forinstallation when a monitoring system is set up in a state of beingcombined with a camera. Another object of the present disclosure is toreduce an amount of work for installation when a monitoring system isset up in a state of being combined with a camera, by suitably formingdirectionality in a predetermined image capture position and clearlyoutputting voice in a prescribed image capture position, even in a casewhere a positional relationship between a camera and a microphone arrayis unclear.

CITATION LIST Patent Literature

PTL 1: Japanese Patent Unexamined Publication No. 2014-143678

SUMMARY OF THE INVENTION

According to an aspect of the present disclosure, there is provided amonitoring system including a microphone array that has a plurality ofsound pickup elements and sound-picks up voice using the sound pickupelements; at least one camera that image-captures a predeterminedposition; a sound source detection unit that detects a sound sourcedirection from the microphone array, based on voice data on the voicethat is sound-picked up; a display unit on which the detected soundsource direction from the microphone array is displayed according toprescribed voice output in the predetermined position; an operation unitthat receives an input of information relating to the camera whichimage-captures the predetermined position, according to designation ofthe displayed sound source direction from the microphone array; and astorage unit in which correspondence information that results fromassociating information relating to the camera, which is input, and thesound source direction from the microphone array is stored.

According to another aspect of the present disclosure, there is provideda microphone array that is connected to a camera and a network, themicrophone array including: a sound pickup unit that has a plurality ofsound pickup elements, and sound-picks up voice; a first networkinterface unit has a hub function of transmitting voice data on thevoice that is sound-picked up by the sound pickup unit, and image datathat is image-captured by the camera and is transmitted to themicrophone array, to the network through a first signal line; a firstelectric power source unit that receives first electric power which istransmitted through the first signal line that is linked to the network,or second electric power which is transmitted from an outer electricpower source; an input switch unit that switches between the firstelectric power or the second electric power and outputs the resultingelectric power; and an output switch unit that supplies an output of theinput switch unit to the camera through a second signal line which linksbetween the microphone array and the camera, or through an electricpower supply line, in which the camera includes an image capturing unitthat image-captures an image, a second network interface unit thattransmits an image data on an image that is image-captured by the imagecapturing unit, to the microphone array through a second signal line,and a second electric power source unit that receives the first electricpower or the second electric power through the second signal line or anelectric power supply line.

According to still another aspect of the present disclosure, there isprovided a microphone array that is included in a monitoring systemaccording to the present disclosure and that is connected to a cameraand a network, the microphone array including: a sound pickup unit thathas a plurality of sound pickup elements, and sound-picks up voice; anetwork interface unit has a hub function of transmitting voice data onthe voice that is sound-picked up by the sound pickup unit, and imagedata that is image-captured by the camera and is transmitted to themicrophone array, to the network through a first signal line; anelectric power source unit that receives first electric power which istransmitted through the first signal line that is linked to the network,or second electric power which is transmitted from an outer electricpower source; an input switch unit that switches between the firstelectric power or the second electric power and outputs the resultingelectric power; and an output switch unit that supplies an output of theinput switch unit to the camera through a second signal line which linksbetween the microphone array and the camera, or through an electricpower supply line.

Furthermore, according to still another aspect of the presentdisclosure, there is provided a sound pickup setting method for use in amonitoring system that includes at least one camera which image-capturesa predetermined position and a microphone array, the method including: astep of causing the microphone array, which have a plurality of soundpickup elements, to sound-pick up prescribed output voice from a soundsource that is placed in the predetermined position; a step of detectinga sound source direction from the microphone array based on voice dataon voice that is sound-picked up by the microphone array; a step ofdisplaying the detected sound source direction from the microphone arrayon a display unit; a step of designating the sound source direction fromthe microphone array, which is displayed on the display unit; a step ofinputting information relating to the camera that image-captures thepredetermined position, according to the designation of the sound sourcedirection; and a step of storing correspondence information that resultsfrom associating the information relating to the camera, which is input,and the sound source direction from the microphone array, in a storageunit.

According to the present disclosure, an amount of work for installationcan be reduced when a monitoring system is set up in a state of beingcombined with a camera. Additionally, according to the presentdisclosure, an amount of work for installation can be reduced when amonitoring system is set up in a state of being combined with a camera,by suitably forming directionality in a predetermined image captureposition and clearly outputting voice in a prescribed image captureposition, even in a case where a positional relationship between acamera and a microphone array is unclear.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram illustrating an external appearance of a monitoringsystem according to a first embodiment.

FIG. 2A is a cross-sectional diagram illustrating a structure in a statewhere the monitoring system that is configured with a camera and amicrophone array is attached to a ceiling attachment metal fitting.

FIG. 2B is an exploded perspective diagram of the monitoring system.

FIG. 3 is a perspective diagram illustrating a rear side of a case ofthe monitoring system in which the camera and the microphone array arecombined.

FIG. 4 is a block diagram illustrating a hardware configuration of themonitoring system.

FIG. 5 is a diagram for describing a first switching operation.

FIG. 6 is a diagram for describing a second switching operation.

FIG. 7 is a diagram for describing a third switching operation.

FIG. 8 is a diagram for describing a fourth switching operation.

FIG. 9A is a diagram illustrating an installed state of a camera and amicrophone array according to a second embodiment, which are installedon a ceiling.

FIG. 9B is a perspective diagram illustrating an external appearance ofthe microphone array.

FIG. 10A is a cross-sectional diagram illustrating an attachmentstructure of a ceiling-embedded monitoring system according to a thirdembodiment.

FIG. 10B is a cross-sectional diagram illustrating a structure of asupport member.

FIG. 11 is a perspective diagram illustrating a rear side of amonitoring system that is embedded into the ceiling.

FIG. 12A is a diagram illustrating an installed state of a monitoringsystem according to a fourth embodiment, which is installed on theceiling.

FIG. 12B is a diagram illustrating an external appearance of amicrophone array according to the fourth embodiment.

FIG. 13A is a diagram for describing later additional work in a casewhere a microphone array is attached to the camera that is alreadyinstalled, on which the existing LAN cable is laid.

FIG. 13B is a diagram for describing the later additional work in thecase where a microphone array is attached to the camera that is alreadyinstalled, on which the existing LAN cable is laid.

FIG. 13C is a diagram for describing the later additional work in thecase where a microphone array is attached to the camera that is alreadyinstalled, on which the existing LAN cable is laid.

FIG. 14A is a diagram for describing later additional work in a casewhere a microphone array is attached to the camera that is alreadyinstalled, on which the existing LAN cable and DC cable are laid.

FIG. 14B is a diagram for describing the later additional work in thecase where a microphone array is attached to the camera that is alreadyinstalled, on which the existing LAN cable and DC cable are laid.

FIG. 15 is a block diagram illustrating a configuration of a monitoringsystem according to a fifth embodiment.

FIG. 16 is a block diagram illustrating a configuration of a voiceprocessing apparatus.

FIG. 17 is a block diagram illustrating a configuration of a microphonearray apparatus.

FIG. 18 is a diagram illustrating a structure of a packet of voice datathat is transmitted from the microphone array apparatus to the voiceprocessing apparatus.

FIG. 19 is a diagram illustrating a layout of the inside of a storewhere the monitoring system is installed.

FIG. 20 is a diagram for describing an outline of preset processing.

FIG. 21 is a flowchart illustrating a sound pickup procedure at the timeof the preset processing and monitoring.

FIG. 22 is a diagram illustrating a screen of a display on which a voicemap is displayed at the time of the preset processing.

FIG. 23 is a diagram illustrating a detail of listing in a presetinformation table that is stored in the microphone array apparatus.

FIG. 24 is a diagram illustrating the screen of the display that isdisplayed after the preset processing.

FIG. 25 is a diagram illustrating the screen of the display that isdisplayed at the time of the monitoring and a sound generation operationby a speaker.

FIG. 26 is a block diagram illustrating a configuration of a monitoringsystem according to a sixth embodiment.

FIG. 27 is a flowchart illustrating a sound pickup procedure at the timeof the preset processing and the monitoring.

FIG. 28 is a flowchart illustrating the preset processing and the soundpickup procedure at the time of the monitoring, which follows theflowchart in FIG. 27.

FIG. 29 is a diagram illustrating the screen of the display that isdisplayed at the time of the preset processing.

FIG. 30 is a diagram illustrating the screen of the display that isdisplayed at the time of the monitoring and the sound generationoperation by the speaker.

FIG. 31 is a block diagram illustrating a configuration of a monitoringsystem according to a seventh embodiment.

FIG. 32 is a diagram illustrating a detail of listing of a presetinformation table that is stored in a table memory.

FIG. 33 is a flowchart illustrating a preset processing procedure.

FIG. 34 is a diagram illustrating the screen of the display that isdisplayed at the time of the preset processing.

FIG. 35 is a flowchart illustrating the sound pickup procedure at thetime of the monitoring.

FIG. 36 is a diagram illustrating the screen of the display that isdisplayed at the time of the monitoring and the sound generationoperation by the speaker.

FIG. 37 is a diagram illustrating a screen of the display that isdisplayed at the time of the monitoring in a first modification exampleof the seventh embodiment.

FIG. 38 is a table illustrating a detail of listing in a presetinformation table in a third modification example of the seventhembodiment.

DESCRIPTION OF EMBODIMENTS

Embodiments of each of a microphone array, a monitoring system, and asound pickup setting method according to the present disclosure will bedescribed below with reference to the drawings. A monitoring camera thatis used for the monitoring system is here referred to as a “camera” forshort. Furthermore, the monitoring system is also referred to as amicrophone-integrated camera that results from combining a camera and amicrophone.

First Embodiment

FIG. 1 is a diagram illustrating an external appearance of monitoringsystem 10 according to a first embodiment. Monitoring system 10 has aconfiguration in which network-supporting type camera 3 and microphonearray 5 are combined.

Camera 3 is an omnidirectional camera (that is, a camera that has a viewangle in an image capture range of 360 degrees) that has a disc-shapedcase 3 z and is equipped with a fish-eye lens on the front surface ofcase 3 z.

Microphone array 5 has a ring-type case 5 z in which a plurality ofmicrophone units 65 (refer to FIG. 4) are concentrically arranged, andpicks up voice in all directions (that is, at an angle of 360 degrees).

Monitoring system 10 is installed on ceiling 18 (refer to FIG. 2) in astate where case 3 z of camera 3 is accommodated into opening 5 r thatis formed in case 5 z of microphone array 5.

FIG. 2A is a cross-sectional diagram illustrating a structure in a statewhere monitoring system 10 that is configured with camera 3 andmicrophone array 5 is attached to ceiling attachment metal fitting 7.

FIG. 2B is an exploded perspective diagram of a monitoring system 10.

Ceiling attachment metal fitting 7 is a member that is made of metal,which is formed in the shape of approximately a disc that has an unevensurface. A material of ceiling attachment metal fitting 7 may be resinor the like.

Hole 7 k is provided in ceiling attachment metal fitting 7. LAN cable 91passes through hole 7 k past the vicinity of the center thereof.Engagement holes 73 that are engaged with fixation pins 54,respectively, which are provided on the bottom surface of microphonearray 5, are concentrically formed in the vicinity of a periphery of asurface of ceiling attachment metal fitting 7. Engagement hole 73 isformed in the shape of approximately a gourd in such a manner that adiameter of one end of engagement hole 73 is greater than that of theother end (refer to FIG. 2B).

Furthermore, pin hole 7 v, into which ceiling fixation pin 44 isinserted, is formed at four points in the center portion of a surface ofceiling attachment metal fitting 7. Ceiling fixation pin 44 passesthrough pin hole 7 v and is screwed into a surface of ceiling 18, andthus ceiling attachment metal fitting 7 is fixed to ceiling 18.

Locking piece 5 t for fixing camera 3 is formed at concentric threepoints in a peripheral portion of the case of microphone array 5.Engagement hole 71 that is engaged with fixation pin 33 which isprovided on the bottom surface of camera 3 is formed in locking piece 5t in the shape of approximately a gourd in such a manner that a diameterof one end of engagement hole 71 is greater than that of the other end(refer to FIG. 2B).

Each of fixation pins 33 and 54 is made up of a head portion that hasthe thickness midway between parameters of one end portion and the otherportion of each of engagement holes 71 and 73, and a body that isslenderer than the head portion.

Furthermore, a side surface of case 3 z of camera 3 is formed on atapered surface that is slightly narrower outward (toward the oppositeside of the ceiling), and an inner wall surface of opening 5 r that isformed in case 5 z of microphone array 5 is formed in the shape of acylinder that is not tapered, or formed on a reversely tapered surfacewith respect to a side surface of case 3 z of camera 3. Therefore, whencase 3 z of camera 3 is attached to opening 5 r of case 5 z ofmicrophone array 5, there is a gap that provides an opening. Because thegap has an adverse influence on microphone array 5, such as reflectionor resonance, spacer 4 is attached to surround camera 3 in such a manneras to fill the gap.

In spacer 4, a plurality of peripheral wall elastic pawls 177 are atequal distances in a circumferential direction within a plurality ofcuts 179, respectively. Peripheral wall elastic pawl 177 is engaged withcase 3 z of camera 3, and thus spacer 4 is attached.

Furthermore, in a case where at the time of work for initialinstallation, camera 3 and microphone array 5 are attached to ceilingattachment metal fitting 7, an attachment procedure is as follows.First, ceiling attachment metal fitting 7 is attached to the ceiling.Next, fixation pin 54 that protrudes from the bottom surface ofmicrophone array 5 is inserted into one end portion that has a greaterdiameter, of engagement hole 73 that is provided in ceiling attachmentmetal fitting 7. In a state where a head portion of fixation pin 54protrudes from engagement hole 73, microphone array 5 is twisted. In astate where microphone array 5 is twisted and thus the head portion offixation pin 54 is moved to the other end portion of engagement hole 73,fixation pin 54 and engagement hole 73 are engaged with each other andmicrophone array 5 is fixed in the direction of central axis e.

Next, LAN cable 91 is pulled out of hole 7 k and is inserted intomicrophone LAN connector 51. Additionally, LAN cable 92 is inserted intomicrophone LAN connector 52.

Later, microphone array 5 is attached to ceiling attachment metalfitting 7, and then camera 3 is attached to locking piece 5 t ofmicrophone array 5 in such a manner that camera 3 is accommodated intoopening 5 r in case 5 z of microphone array 5. At this time, fixationpin 33 is engaged with engagement hole 71 that is formed in lockingpiece 5 t. A procedure for engaging fixation pin 33 with engagement hole71 is the same as a procedure for engaging fixation pin 54 withengagement hole 73.

Furthermore, in a case where camera 3 is already attached to ceilingattachment metal fitting 7A (refer to FIG. 9A), ceiling attachment metalfitting 7 is replaced with ceiling attachment metal fitting 7B formicrophone array 5.

FIG. 3 is a perspective diagram illustrating a rear side of a case ofmonitoring system 10 in which camera 3 and microphone array 5 arecombined. There is a gap (refer to FIG. 2B) between a rear surface ofcase 3 z of camera 3 connect and ceiling 18, and when case 5 z ofmicrophone array 5 is combined in such a manner as to externally fitcase 3 z of camera 3, space 16 is generated that is surrounded by aninner side of opening 5 r in case 5 z of microphone array 5, a rearsurface of case 3 z of camera 3 and ceiling 18.

In order to be exposed toward space 16, microphone LAN connector 51 forinput, and microphone LAN connector 52 and microphone DC connector 53for output are provided in the inner wall surface of opening 5 r that isformed in case 5 z of microphone array 5. Microphone DC connector 53corresponds to both of microphone DC connector 53A (refer to FIG. 4) forinput and microphone DC connector 53B for output. Furthermore, cameraLAN connector 31 is provided on an end portion that is formed on therear surface of case 3 z of camera 3.

One end portion (a plug) of short LAN cable 92 is inserted intomicrophone LAN connector 52, and the other end portion (a plug) of shortLAN cable 92 is inserted into camera LAN connector 31. LAN cable 92 isaccommodated into space 16. Furthermore, one end portion (a plug) oflong LAN cable 91 that is laid on a rear side of ceiling 18 is insertedinto microphone LAN connector 51. The other end portion of LAN cable 91is connected to PoE apparatus 6 (refer to FIG. 4) that is installed onthe rear side of ceiling 18.

FIG. 4 is a block diagram illustrating a hardware configuration ofmonitoring system 10. Monitoring system 10, as described above, isconfigured with camera 3 and microphone array 5, and is connected tonetwork 9 through PoE apparatus 6. Furthermore, camera 3 and microphonearray 5 is capable of receiving power from PoE apparatus 6 through LANcable 91, or from outer DC power supply 85 through DC cable 94.

PoE apparatus 6 is an apparatus that is interposed between network 9 andmicrophone array 5 and is capable of supplying power to microphone array5 through LAN cable 91 that is a communication cable. An amount ofelectric power that is capable of being supplied in compliance with aPoE specification is 15.4 W (on the receiving side) in a voltage rangebetween 44 V and 57 V. An amount of electric power that is capable ofbeing supplied in compliance with PoE+that is a higher-levelspecification than PoE is 30.0 W (on the receiving side) in a voltagerange between 50 V and 57 V. In the present embodiment, electric poweris capable of being supplied in compliance with any one of the PoE andPoE+specifications. Furthermore, in a case where electric power issupplied in compliance with PoE and PoE+, there are type A that uses apin in a shared manner both for communication and electric power, andtype B that uses a pin for electric power, which is normally not usedfor communication. In the present embodiment, any one of the types iscapable of being supported. This is also the same for the followingembodiments.

Network 9 is a communication network, such as a wired LAN or theInternet. Over network 9, image data that is image-captured innetwork-supporting type camera 3, or voice data that is sound-picked upin network-supporting type microphone array 5 is transferred to an outerapparatus that is connected to network 9.

Microphone array 5, as described above, has microphone set 64 that is anomnidirectional microphone array and includes a plurality of microphoneunits 65, host CPU 55, input switch unit 57, output switch unit 58,network interface unit 56, and step-up/down power supply circuit 59.

For example, a high sound-quality Electret Condenser Microphone (ECM) isused for a plurality of microphone units 65. Each of the plurality ofmicrophone units 65 that are included in microphone set 64 does not havedirectionality and outputs the voice data that is sound-picked up tohost CPU 55.

Host CPU 55 controls operation of each unit of microphone array 5. Forexample, host CPU 55 transmits the voice data that is sound-picked up bythe plurality of microphone units 65 to network 9 through networkinterface unit 56, drives input switch unit 34, switches a supply sourceof electric power that is supplied to microphone array 5, drives outputswitch unit 58, and switches a supply source of electric power to camera3.

Network interface unit 56 has a hub function of transmitting the voicedata on voice that is sound-picked up by microphone array 5, and theimage data on an image that is image-captured by camera 3 and istransmitted to microphone array 5 to network 9 through LAN cable 91.Network interface unit 56 switches communication that is performed withnetwork 9 through LAN cable 91, to communication with the camera 3 sidethrough microphone LAN connector 52, for transfer, andramification-controls communication that is performed between host CPU55 and network 9, by performing switching. The control of thecommunication may be by a repeater function, not through the switching.

Step-up/down power supply circuit 59 has PoE electric power receptioncircuit 61, system electric power supply circuit 62, and PoE electricpower transmission circuit 63.

When receiving electric power that is transmitted from PoE apparatus 6,through LAN cable 91 that is connected to microphone LAN connector 51,PoE electric power reception circuit 61 steps down the electric power toa voltage (for example, 12 V) that is approximately equal to that ofouter DC power supply 85, and outputs the resulting electric power toinput terminal 57 y of input switch unit 57.

System electric power supply circuit 62 steps down or steps up theelectric power that is input from output terminal 57 z of input switchunit 57, and generates a power supply voltage that is supplied to eachunit of microphone array 5.

PoE electric power transmission circuit 63 steps up the electric powerthat is input from output terminal 58 x of output switch unit 58, andtransmits the resulting electric power to the camera 3 side through LANcable 92 that is connected to microphone LAN connector 52.

Input switch unit 57 has a switch function of changing a supply sourceof the electric power that is supplied to each unit of microphone array5 to outer DC power supply 85 or PoE apparatus 6. According to aninstruction from host CPU 35, input switch unit 57 switches outputterminal 57 z to system electric power supply circuit 62, to any one ofinput terminal 57 x on the microphone DC connector 53A side, which islinked to outer DC power supply 85 and input terminal 57 y that islinked to PoE electric power reception circuit 61 within step-up/downpower supply circuit 59. Furthermore, input switch unit 57 is alsocapable of changing the switch function in a semi-fixed manner withmanual operation by an operator or the like or of configuring anelectric circuit with passive components such as a diode and changingelectric power automatically, without being caused by host CPU 55 to doso.

Output switch unit 58 switches a supply source of electric power that issupplied to camera 3, to outer DC power supply 85 or PoE electric powerreception circuit 61. Output switch unit 58 has a switch function ofchanging input terminal 58 z that is linked to output terminal 57 z ofinput switch unit 57 to any one of output terminal 58 y on themicrophone DC connector 53B side, which is linked to camera 3 through DCcable 93, and output terminal 58 x that is linked to PoE electric powertransmission circuit 63 within step-up/down power supply circuit 59,according to an instruction from host CPU 55. Furthermore, output switchunit 58 is also capable of changing the switch function in a semi-fixedmanner with the manual operation by the operator or the like, withoutbeing caused by host CPU 55 to do so.

On the other hand, camera 3 has host CPU 35, input switch unit 34,network interface unit 36, step-up/down power supply circuit 37, andimage capturing unit 40.

Image capturing unit 40 is equipped with a fish-eye lens that isinstalled on the front surface of case 3 z of camera 3, has imagingelement that is capable of capturing an image as anomnidirectional-fashion image, and outputs image data that isimage-captured, to host CPU 35. For example, a Charged Couple Device(CCD) or a Complementary Metal Oxide Semiconductor (CMOS) is used as animaging element.

Host CPU 35 controls operation of each unit of camera 3. Host CPU 35transmits the image data that is image-captured by image capturing unit40 to network 9 through network interface unit 36, and drives inputswitch unit 34 and switches a supply source of electric power that issupplied to camera 3.

Input switch unit 34 has a switch function of changing a supply sourceof electric power to the camera DC connector 32 side or the PoE electricpower reception circuit 38 side according to an instruction from hostCPU 35. That is, input switch unit 34 switches output terminal 34 z thatis linked to system power supply circuit 39 to any one of input terminal34 x that is linked to the camera DC connector 32 side and inputterminal 34 y that is linked to PoE electric power reception circuit 38.Furthermore, input switch unit 34 is also capable of changing the switchfunction in a semi-fixed manner with the manual operation by theoperator or the like or of configuring an electric circuit with passivecomponents such as a diode and changing electric power automatically,without being caused by host CPU 35 to do so.

Network interface unit 36 transmits the image data that isimage-captured by camera 3, to microphone array 5 through LAN cable 92,and controls communication that is performed between microphone array 5and host CPU 35 that are connected to each other through camera LANconnector 31 and LAN cable 92.

Step-up/down power supply circuit 37 has PoE electric power receptioncircuit 38 and system power supply circuit 39.

PoE electric power reception circuit 38 receives electric power that istransmitted from PoE apparatus 6 that is connected through camera LANconnector 31 and LAN cable 92, and steps down the electric power to avoltage (for example, 12 V) that is approximately equal to that of outerDC power supply 85 and outputs the resulting electric power to inputterminal 34 y of input switch unit 34.

System power supply circuit 39 steps down or up the electric power thatis input from output terminal 34 z of input switch unit 34, andgenerates a power supply voltage that is supplied to each unit of camera3.

The switching operation by monitoring system 10 that has theconfiguration described above is described. FIG. 5 is a diagram fordescribing a first switching operation. On the microphone array 5 side,host CPU 55 performs switching in such a manner that output terminal 57z of input switch unit 57 is connected to input terminal 57 y of the PoEelectric power reception circuit 61 side. Furthermore, host CPU 55switches input terminal 58 z of output switch unit 58 to output terminal58 x on the PoE electric power transmission circuit 63 side. On theother hand, on the camera 3, host CPU 35 switches output terminal 34 zof input switch unit 34 to input terminal 34 y on the PoE electric powerreception circuit 38 side. A flow of electric power that is supplied isillustrated by a broken line a in the drawing.

In a case where such a first switching operation is performed, inmicrophone array 5, electric power is supplied from PoE apparatus 6 toPoE electric power reception circuit 61 through LAN cable 91, microphoneLAN connector 51, and inner wiring 91 z. The electric power that isstepped up or down in PoE electric power reception circuit 61 issupplied to system electric power supply circuit 62 through inner wiring61 z, input switch unit 57, output switch unit 58, and inner wiring 62z. System electric power supply circuit 62 generates and supplieselectric power that is used in each unit of microphone array 5.

Furthermore, the electric power that is stepped down in PoE electricpower reception circuit 61 is supplied to PoE electric powertransmission circuit 63 through inner wiring 61 z, input switch unit 57,output switch unit 58, and inner wiring 63 z. The electric power that isstepped up in PoE electric power transmission circuit 63 is supplied toPoE electric power reception circuit 38 through inner wiring 92 z,microphone LAN connector 52, LAN cable 92, camera LAN connector 31, andinner wiring 31 z. The electric that is stepped up in PoE electric powerreception circuit 38 is supplied to system power supply circuit 39through inner wiring 38 z, input switch unit 34, and inner wiring 39 z.System power supply circuit 39 generates and supplies electric powerthat is used in each unit of camera 3.

In this manner, when the first switching operation is performed, any oneof microphone array 5 and camera 3 receives electric power from PoEapparatus 6 for operation. More precisely, microphone array 5 receiveselectric power that is supplied from PoE apparatus 6, for operation, andtransmits the electric power towards camera 3. Camera 3 receives theelectric power that is supplied from PoE apparatus 6 through microphonearray 5 and LAN cable 92, and operates, for operation.

Accordingly, a connection between microphone array 5 and camera 3 can beachieved using one short LAN cable 92 that links microphone LANconnector 52 and camera LAN connector 31. The electric power supply andthe communication are performed using the one LAN cable 92.

At this point, the case where host CPU 55 drives input switch unit 57and output switch unit 58 and host CPU 35 drives input switch unit 34 isdescribed, but this switching is once performed at the time of work forlater additional installation of microphone array 5 or work for initialinstallation of monitoring system 10. Therefore, the switching may beperformed by the manual operation by the operator and the like, withoutbe caused by the host CPU to do so. Because this is the same for thefollowing switching operations, the description of this is omitted.

FIG. 6 is a diagram for describing a second switching operation. A flowof electric power that is supplied is illustrated in a broken line b inthe drawing. Reception of electric power by microphone array 5 from PoEapparatus 6 is the same in the first switching operation. In the secondswitching operation, host CPU 55 within microphone array 5 switchesinput terminal 58 z of output switch unit 58 to output terminal 58 y onthe microphone DC connector 53B side. Furthermore, host CPU 35 withincamera 3 switches output terminal 34 z of input switch unit 34 to inputterminal 34 x on the camera DC connector 32 side.

In a case where such a second switching operation is performed, supplyof electric power to system electric power supply circuit 62 ofmicrophone array 5 is the same as in the first switching operation.Supply of electric power to camera 3 is performed as follows. Electricpower that is stepped down in PoE electric power reception circuit 61 inmicrophone array 5 is sent to inner wiring 61 z, input switch unit 57,output switch unit 58, microphone DC connector 53B, DC cable 93, andcamera DC connector 32. Additionally, electric power that is input fromcamera DC connector 32 is supplied to system power supply circuit 39through input switch unit 34 and inner wiring 39 z.

In this manner, when the second switching operation is performed,reception of electric power by any one of microphone array 5 and camera3 from PoE apparatus 6 for operation is the same as is the same casewith the first operation. However, in the second switching operation,supply of electric power is performed using DC cable 93 that linksmicrophone DC connector 53B and camera DC connector 32, without usingLAN cable 92.

Accordingly, between microphone array 5 and camera 3, the communicationis performed using LAN cable 92, and the supply of electric power isperformed using DC cable 93. More precisely, in the connection betweenmicrophone array 5 and camera 3, because the communication and thesupply of electric power are performed using different cables, twocables, that is, short LAN cable 92 and short DC cable 93 are needed.Furthermore, between microphone array 5 and camera 3, because DCelectric power that is generated within microphone array 5, as is, isreceived in camera 3, conversion loss that results from PoE electricpower transmission circuit 63 performing the step-up and transmittingelectric power using LAN cable 92 does not occur.

FIG. 7 is a diagram for describing a third switching operation. Thethird switching operation is for a case where PoE apparatus 6 is notpresent. More precisely, a connection is directly made betweenmicrophone LAN connector 51 and network 9 using LAN cable 91.

On the microphone array 5 side, host CPU 55 performs the switching insuch a manner that output terminal 57 z of input switch unit 57 isconnected to input terminal 57 x on the microphone DC connector 53Aside. Furthermore, host CPU 55 switches input terminal 58 z of outputswitch unit 58 to output terminal 58 x on the PoE electric powertransmission circuit 63 side. On the other hand, on the camera 3 side,host CPU 35 switches output terminal 34 z of input switch unit 34 toinput terminal 34 y on the PoE electric power reception circuit 38 side.A flow of electric power that is supplied is illustrated by a brokenline c in the drawing.

In a case where such a third switching operation is performed, inmicrophone array 5, electric power is supplied from outer DC powersupply 85 to system electric power supply circuit 62 through microphoneDC connector 53A, input switch unit 57, and inner wiring 62 z.Furthermore, the electric power that is input from microphone DCconnector 53A is supplied to PoE electric power transmission circuit 63through input switch unit 57, output switch unit 58, and inner wiring 63z. The electric power that is stepped up in PoE electric powertransmission circuit 63 is supplied to camera 3 along the same pathwayas in the first switching operation.

In this manner, when the third switching operation is performed, any oneof microphone array 5 and camera 3 receives electric power from outer DCpower supply 85 for operation. Furthermore, because instead of PoEapparatus 6, outer DC power supply 85 is already provided for camera 3,the connection between microphone array 5 and camera 3 can be achievedusing one short LAN cable 92 that links microphone LAN connector 52 andcamera LAN connector 31. The electric power supply and the communicationare performed using the one LAN cable 92.

FIG. 8 is a diagram for describing a fourth switching operation. Thefourth switching operation is for the case where PoE apparatus 6 is notpresent, in the same as is the case with the third switching operation.The connection is directly made between microphone LAN connector 51 andnetwork 9 using LAN cable 91. Furthermore, LAN cable 92 that linksmicrophone LAN connector 52 and camera LAN connector 31 can be dedicatedfor the communication.

On the microphone array 5 side, host CPU 55 performs the switching insuch a manner that output terminal 57 z of input switch unit 57 isconnected to input terminal 57 x on the microphone DC connector 53Aside. Furthermore, host CPU 55 switches input terminal 58 z of outputswitch unit 58 to output terminal 58 y on microphone DC connector 53Bside. On the other hand, on the camera 3 side, host CPU 35 switchesoutput terminal 34 z of input switch unit 34 to input terminal 34 x onthe camera DC connector 32 side. A flow of electric power that issupplied is illustrated by a broken line d in the drawing.

In a case where such a fourth switching operation is performed, inmicrophone array 5, electric power is supplied from outer DC powersupply 85 to system electric power supply circuit 62 through microphoneDC connector 53A, input switch unit 57, and inner wiring 62 z.Furthermore, the electric power that is input from microphone DCconnector 53A is supplied to system power supply circuit 39 throughinput switch unit 57, output switch unit 58, microphone DC connector53B, DC cable 93, camera DC connector 32 and input switch unit 34.

Accordingly, when the fourth switching operation is performed, theconnection between microphone array 5 and camera 3 is made using twocables, that is, LAN cable 92 dedicated for the communication and DCcable 93 dedicated for the supply of electric power. Furthermore, cablesare divided into one for the communication and the other for the supplyof electric power, and thus a connection situation is easy to recognize.Furthermore, between microphone array 5 and camera 3, because DCelectric power that is generated within microphone array 5, as is, isreceived in camera 3, conversion loss that results from PoE electricpower transmission circuit 63 performing the step-up and transmittingelectric power using LAN cable 92 does not occur.

As described above, in the monitoring system according to the firstembodiment, in a case where camera 3 is already attached to the ceiling,later, microphone array 5 can be additionally installed, using LAN cable91, as is, that is laid up to camera 3. Therefore, when later,microphone array 5 is additionally installed, work for wiring, such asadding new LAN cable, is not needed.

For example, in the case of the first switching operation, theconnection is achieved by adding only one short LAN cable 92. In thecase of the second switching operation, the connection is achieved bytwo cables, that is, short LAN cable 92 and short DC cable 93. In thecase of the third switching operation, because instead of PoE apparatus6, outer DC power supply 85 is already provided for camera 3, when anoutput destination of outer DC power supply 85 is only changed from thecamera to the microphone, the connection is achieved, by adding only oneshort LAN cable 92, as is the case with the first switching operation.In the case of the fourth switching operation, because when comparedwith the third switching operation, the supply of electric power isperformed using DC cable 93 that links microphone DC connector 53B andcamera DC connector 32 without using LAN cable 92, the connection isachieved by adding only two cables, that is, short LAN cable 92 andshort DC cable 93, as is the case with the second switching operation.

In this manner, an amount of work for installation when setting up themonitoring system by combining the microphone array and the camera canbe reduced. More precisely, later, a microphone array can beadditionally installed, using the wiring, as is, such as the LAN cablethat is already used for the existing camera. Therefore, work forinstallation for additional wiring is unnecessary, and thus an amount ofwork for installation can be reduced. Furthermore, a length of a LANcable that is to be added can be shortened. Furthermore, because thenumber of LAN cables is small and the length of the LAN cable isshortened, even in the case of work for new installation of a monitoringsystem, an amount of work for installation can be decreased.

Furthermore, because LAN cable 92 that is added is accommodated intospace 16 that is surrounded by the inner side of opening 5 r that isformed in case 5 z of microphone array 5, the rear side of case 3 z ofcamera 3, and ceiling 18, LAN cable 92 is not seen from the outside, anda beautiful sight of monitoring system 10 can be prevented from beingspoiled by LAN cable 92. Furthermore, camera 3 can be attached in such amanner that the length of LAN cable 92 that is added is set to be longand thus the operator connects camera 3 and microphone array 5personally, using LAN cable 92 and causes LAN cable 92 to beaccommodated into space 16 in the rear side of camera 3. As a result,efficiency of work is increased.

Furthermore, case 5 z of microphone array 5 is attached to ceilingattachment metal fitting 7 in such a manner that case 5 z of microphonearray 5 externally fits case 3 z of camera 3, and thus it is possiblethat a plurality of microphone units 65 of microphone array 5 and imagecapturing unit 40 of camera 3 are arranged to be aligned with a surfaceparallel to the surface of the ceiling.

In the embodiment described above, the function (network interface unit56) of causing the network to ramify, or the function (PoE electricpower transmission circuit 63 or the like) of distributing a powerresource is provided to the microphone array side, but these functionsmay be provided to the camera side, in which case the same monitoringsystem can be realized. This is the same for the following embodiments.

Second Embodiment

In the first embodiment described above, the case is described wheremicrophone array 5 that has ring-type case 5 z is attached to ceilingattachment metal fitting 7 in such a manner as to externally fit camera3 that has disc-shaped case 3 z, but in a second embodiment, a case isdescribed where the microphone array is installed on the ceiling in astate of being positioned a distance away from the camera.

FIGS. 9A and 9B are diagrams illustrating a configuration of monitoringsystem 10A according to the second embodiment. FIG. 9A is a diagramillustrating an installed state of camera 3A and microphone array 5Athat are installed on ceiling 18. FIG. 9B is a perspective diagramillustrating an external appearance of microphone array 5A.

Monitoring system 10A according to the second embodiment has almost thesame configuration as monitoring system 10 according to the firstembodiment. Constituent elements that are the same as those in the firstembodiment, which are described above, are given the same referencenumerals, and descriptions thereof are omitted.

Ceiling attachment metal fitting 7A for attaching camera 3A is fixed byceiling fixation pint 44A to ceiling 18. Furthermore, ceiling attachmentmetal fitting 7B for attaching microphone array 5A is fixed by ceilingfixation pin 44B to ceiling 18 in a state of being positioned a distanceaway from ceiling attachment metal fitting 7A.

Ceiling attachment metal fitting 7A has almost the same structure as aninner portion of ceiling attachment metal fitting 7 that is described inthe first embodiment. Therefore, fixation pin 33 that protrudes from thebottom surface of camera 3A is inserted into engagement hole 71 that isformed in ceiling attachment metal fitting 7A, and the camera 3A istwisted and thus is fixed to ceiling attachment metal fitting 7A.

Furthermore, ceiling attachment metal fitting 7B has almost the samestructure as an outer portion of ceiling attachment metal fitting 7 thatis described in the first embodiment. Therefore, fixation pin 54 thatprotrudes from the bottom surface of microphone array 5A is insertedinto engagement hole 73 that is informed in ceiling attachment metalfitting 7B, and microphone array 5A is twisted and thus is fixed toceiling attachment metal fitting 7B.

Furthermore, because microphone array 5A has ring-type case 5 y, cover79 is embedded into case 5 y in such a manner as to cover opening 5 rthat is formed in case 5 y. Cover 79 has fixation pin 79 z as is thecase with case 5 y of microphone array 5A, fixation pin 79 z is insertedinto engagement hole 72 that is formed in locking piece 5 t whichprotrudes as one portion of microphone array 5A, cover 79 is twisted,and thus is fixed to ceiling attachment metal fitting 7B.

Cable outlet (notch) 5 x for pulling out LAN cable 92A is formed in aside surface of case 5 y of microphone array 5A. In the same manner,cable outlet 3 x for pulling out LAN cable 92A is formed in a sidesurface of case 3 y of camera 3A. LAN cable 92A is laid in such a mannerthat LAN cable 92A passes through cable outlet 5 x and cable outlet 3 xand links microphone LAN connector 52 which is provided into opening 5 rin microphone array 5A and camera LAN connector 31 which is provided ona rear side of camera 3A. Furthermore, moor 98 that accommodates LANcable 92A is attached to ceiling 18. LAN cable 92A that is laid on thefront side of ceiling 18 is hidden by being accommodated into moor 98.Thus, LAN cable 92A is not seen from the outside. Furthermore, LAN cable92A is laid in a straight line, and thus can be installed to have theshortest length.

In this manner, in the second embodiment, one end side of LAN cable 92Ais accommodated into opening 5 r that is formed in case 5 y ofmicrophone array 5A, and LAN cable 92A is pulled out of cable outlet 5 xthat is formed in case 5 y. Furthermore, the other end side of LAN cable92A is accommodated on a rear side of case 3 y of camera 3A, and LANcable 92A is pulled out of cable outlet 3 x that is formed in case 3 y.

Accordingly, in the case of the later work for adding a microphone arrayto the existing camera, later, a microphone array can be additionallyinstalled, using the wiring, as is, such as the LAN cable that isalready used for the existing camera, as is the case with the firstembodiment describe above. Therefore, the work for installation foradditional wiring is unnecessary, such as installing one more long LANcable, and thus the amount of work for installation can be reduced.

Furthermore, LAN cable 92A that links microphone LAN connector 52 thatis provided into opening 5 r in microphone array 5A, and camera LANconnector 31 that is provided on the rear side of camera 3A can extendalong the surface of ceiling 18 through cable outlet 5 x and cableoutlet 3 x, thereby improving the work efficiency of extension.

Furthermore, in a case where microphone array 5A and camera 3A areinstalled on the surface of ceiling 18, the degree of freedom forinstallation is increased. More precisely, microphone array 5A may notbe installed in such a manner that microphone array 5A and camera 3Ahave the same central axis e, and it is possible that monitoring system10A suitable for a place that is a monitoring target is installed.

Third Embodiment

In the first and second embodiments describe above, the monitoringsystem that is configured with the microphone array and the camera isinstalled on the surface of the ceiling, but in a third embodiment, aceiling-embedded monitoring system is described.

FIG. 10A is a cross-sectional diagram illustrating an attachmentstructure of ceiling-embedded monitoring system 10B according to thethird embodiment. Concrete ceiling building frame 19, which is oneportion of a building, is provided on the rear side of ceiling 18.Fixation bolt 19 z is fixed in such a manner that fixation bolt 19 zprotrudes from ceiling building frame 19 toward the ceiling 18 side. Aswill be described below, plate-shaped protrusion member 7 m that is oneportion of attachment metal fitting 7C is inserted between a pair ofnuts 19 y that are engaged with fixation bolt 19 z, and thus attachmentmetal fitting 7C is fixed by fixation bolt 19 z in a suspended state.

FIG. 11 is a perspective diagram illustrating the rear side ofmonitoring system 10B that is embedded into ceiling 18. Monitoringsystem 10B is attached to attachment metal fitting 7C. Attachment metalfitting 7C is made up of plate 7 n that has an approximately roundshape, plate-shaped protrusion member 7 m that is bent in such a manneras to protrude from a surface of plate 7 n, and support member 7 p thatwill be described below.

Locking piece 7 z for fixing camera 3B, which protrudes in the directionof central axis e, is formed at concentric three points on a surface ofplate 7 n, as is the case with the first embodiment described above.Additionally, locking piece 7 y 1 for fixing microphone array 5B, whichis one step lower than locking piece 7 z and protrudes in the directionof central axis e, is formed at four points on the surface of plate 7 n,which are in the cross direction on a concentric circle that is largerthan the concentric circle on which locking pieces 7 z are formed, as isthe case with the first embodiment described above.

Engagement hole 71 z that is engaged with fixation pin 33 which isprovided on the bottom surface of camera 3B is formed in locking piece 7z, as is the case with the first embodiment described above. In the samemanner, engagement hole 73 z that is engaged with fixation pin 54A thatis provided on the bottom surface of microphone array 5B is formed inlocking piece 7 y 1.

Furthermore, opening 7 u is formed in plate 7 n in such a manner thatcamera LAN connector 31 is exposed. Additionally, opening 7 h is formedin plate 7 n in such a manner that microphone LAN connectors 51A and 52Athat are arranged on a rear surface of case 5 z 1 of microphone array 5Bare exposed.

Additionally, approximately shaped vertical member 7 q that is oneportion of support member 7 p is formed at four points on the rim ofplate 7 n that has an approximately round shape, which are in the crossdirection, in such a manner that vertical member 7 q is bent. FIG. 10Bis a cross-sectional diagram illustrating a structure of support member7 p.

Support member 7 p has bolt 7 p 1 that passes through a hole that isformed in each of protrusion portions 7 q 1 and 7 q 2 of vertical member7 q, and C ring 7 p 3 for preventing locking of bolt 7 p 1 thatprotrudes from the hole which is formed in protrusion portion 7 q 2. Thehole that is formed in each of protrusion portions 7 q 1 and 7 q 2 is athrough hole (a so-called clearance hole) through which bolt 7 p 1passes. Furthermore, support member 7 p has pressing plate 7 p 5 forinserting ceiling plate 18 z, into which bolt 7 p 1 is screwed, andreception ring 7 p 6.

Pressing plate 7 p 5 is an approximately C-shaped member that resultsfrom connecting internally threaded upper plate 7 p 51, into which bolt7 p 1 is screwed, internally threaded lower plate 7 p 52, into whichbolt 7 p 1 is screwed, and pressure plate 7 p 53 that presses ceilingplate 18 z along with protrusion portion 7 q 2 as a reception ring.Instead of lower plate 7 p 52 being internally threaded, a through holethrough which bolt 7 p 1 passes may be formed in lower plate 7 p 52.

In support member 7 p that has this structure, with ceiling plate 18 zbeing interposed between protrusion portion 7 q 2 as the reception ringand pressure plate 7 p 53, a head portion of bolt 7 p 1 is rotated inthe direction of an arrow i, and thus pressing plate 7 p 5 into whichbolt 7 p 1 is screwed is moved to the reception ring 7 p 6 side.Accordingly, attachment metal fitting 7C including support member 7 p isfixed to ceiling plate 18 z that is interposed between protrusionportion 7 q 2 as the reception ring and pressure plate 7 p 53.

In this manner, attachment metal fitting 7C is fixed to ceiling buildingframe 19 by fixation bolt 19 z by which protrusion member 7 m isinserted, in a suspended state, and is fixed to ceiling plate 18 z bysupport member 7 p in an inserted state.

In the case of the ceiling-embedded monitoring system 10B, unlike in thefirst and second embodiments described above, case 5 z 1 of microphonearray 5B has approximately the same thickness as case 3 z of camera 3B.That is, because microphone LAN connectors 51A and 51B are arranged on arear side of case 5 z 1 of microphone array 5B, opening 5 r thatproduces space 16 in such a manner that LAN cable 92 is accommodateddoes not even need to be formed. Because of this, the thickness of case5 z 1 can be decreased.

The attachment of case 5 z 1 of microphone array 5B and case 3 z ofcamera 3B to attachment metal fitting 7C is the same as that in thefirst embodiment described above. That is, fixation pin 33 thatprotrudes from the bottom surface of camera 3B is inserted intoengagement hole 71 z that is formed in attachment metal fitting 7C, andthe camera 3B is twisted and thus is fixed to attachment metal fitting7C. Furthermore, when it comes to microphone array 5B, fixation pin 54Ais inserted into engagement hole 73 z that is formed in locking piece 7y 1, and microphone array 5B is twisted and thus is fixed to attachmentmetal fitting 7C.

Monitoring system 10B that is attached to attachment metal fitting 7C isembedded into ceiling 18 securely in a stable state without wobbling.

In this manner, in monitoring system 10B according to the thirdembodiment, as is the case with the first embodiment described above,later, a microphone array can be additionally installed, using thewiring, as is, such as the LAN cable that is already used for theexisting camera. Therefore, the work for installation for additionalwiring is unnecessary, and thus an amount of work for installation canbe reduced.

Furthermore, in the case of ceiling-embedded microphone array 5B,because microphone LAN connectors 51A and 52A are arranged on a rearsurface of case 5 z, opening 5 r that produces space 16 thataccommodates LAN cable 92 does not even need to be formed behind case 5z 1 of ceiling-embedded microphone array 5B. Because of this, thethickness of microphone array 5B can be decreased, and miniaturizationof microphone array 5B is achieved.

Furthermore, because a portion of ceiling 18, from which monitoringsystem 10B is exposed, is decreased, a beautiful sight of ceiling 18 isnot spoiled and ceiling 18 looks nice. Furthermore, mental pressure toinstall monitoring system 10B is reduced.

Fourth Embodiment

In the first, second, and third embodiments described, the monitoringsystem that results from combining the omnidirectional camera and themicrophone array are described, but a monitoring system according to afourth embodiment represents a case where a PTZ camera and themicrophone array are combined.

FIGS. 12A and 12B are diagrams, each illustrating a configuration ofmonitoring system 10C according to the fourth embodiment. FIG. 12A is adiagram illustrating an installed state of monitoring system 10C that isinstalled on ceiling 18 according to the fourth embodiment. Camera 3Chas pole 3 j that is fixed to ceiling 18, camera case 3 u that isconnected to pole 3 j, and drive unit 3 i that turns camera case 3 ufreely, and is a PTZ camera that is freely moved in the panning ortilting direction according to the image capture range and freely zoomsin and out. (enlargement and reduction). Camera LAN connector 31 g isprovided on a rear surface of camera case 3 u.

Furthermore, microphone array 5C is installed on ceiling 18 in a stateof being positioned a short distance away from camera 3C. FIG. 12B is adiagram illustrating an external appearance of microphone array 5Caccording to the fourth embodiment. Microphone LAN connectors 51C and52C are arranged adjacent to each other on a side surface of case 5 w ofmicrophone array 5C. Furthermore, a plurality of microphone units 65 (8microphone units, here) are concentrically arranged on a front surface(a surface that is opposite to the ceiling) of microphone array 5C. LANcable 92 e is connected between camera LAN connector 31 g and microphoneLAN connector 52C.

In monitoring system 10C according to the fourth embodiment, in the caseof the later work for adding a microphone array to the existing camera,later, a microphone array can be additionally installed, using thewiring, as is, such as the LAN cable that is already used for theexisting camera, as is the case with the first embodiment describeabove. Therefore, the work for installation for additional wiring isunnecessary, and thus an amount of work for installation can be reduced.

Furthermore, because microphone LAN connectors 51C and 52C are arrangedadjacent to each other on the side surface of case 5 w of microphonearray 5C, in a case where LAN cable 92 e is laid in such a manner thatLAN cable 92 e extends along ceiling 18 without passing through a spacebetween the roof and the ceiling, the length of LAN cable can bedecreased. Thus, the work efficiency of laying the LAN cable can beimproved.

Furthermore, the PTZ camera and the microphone array are combined, andthus the monitoring can be performed in a state where image data that isimage-captured by the PTZ camera in a prescribed direction and voicedata that is sound-picked up by the microphone array are associated.

At this point, as the monitoring system, the case of the combinationwith the PTZ camera is described, but combination with a fixation camerain which an image capture range is fixed in a specific direction may bepossible.

Fifth Embodiment

FIG. 15 is a block diagram illustrating a configuration of monitoringsystem 205 according to a fifth embodiment. Monitoring system 205, forexample, has a configuration in which a plurality of camera apparatusesC11 to C1 n for monitoring, which are installed in a store such as aconvenience store, microphone array apparatus MA, recorder apparatus240, and Personal Computer (PC) 230 are connected to one another throughnetwork 215.

Each of camera apparatuses C11 to C1 n is a fixation camera that has afixed view angle, and image-captures a picture (which is hereinafterdefined as including a still image and a moving image) of the vicinityof a predetermined position that is an image capture target area. n is apositive value that is equivalent to an identification number of thecamera apparatus. Because camera apparatuses C11 to C1 n are differentonly in image capture target area from each other and any one has thesame configuration, a representative example, a configuration andoperation of camera apparatus C11 are described. Furthermore, in a casewhere a camera apparatus that is different in specification from thecamera apparatus C11, such a camera apparatus is described on eachoccasion. The camera apparatus C11 transfers data (picture data) on apicture that is image-captured to PC 230 through network 215, andrecords the data in recorder apparatus 240. Cameras C11 to C1 n, forexample, may employ the same configuration as any one of cameras 3, 3A,3B, and 3C according to the first to fourth embodiments.

Microphone array apparatus MA is installed, for example, on a ceilingwithin store 210 (refer to FIG. 19), and a plurality of microphones M1to Mn (refer to FIG. 17) (for example, eight microphones) are arrangedon a concentric circle to face downward. Thus, it is possible that voicewithin the store is sound-picked up. Microphone array apparatus MAsound-picks up voice in the vicinity of the image capture target areausing each of microphones M1 to Mn, transmits the data (voice data) onthe voice that is sound-picked up by each of microphones M1 to Mn, to PC230 through network 215, and records the data on the voice in recorderapparatus 240. Each of microphones M1 to Mn may be a non-directionalmicrophone, a bidirectional microphone, a unidirectional microphone, ora sharply directional microphone. Microphone array apparatus MA, forexample, may employ the same configuration as any one of microphonearrays 5, 5A, 5B, and 5C according to the first to fourth embodiments.

Recorder apparatus 240 is configured to include a control unit (notillustrated) for controlling processing, such as recording data, and arecording unit (not illustrated) for storing picture data and the voicedata. Recorder apparatus 240 records the picture data that isimage-captured by each of camera apparatuses C11 to C1 n, and the voicedata that is sound-picked up by microphone array apparatus MA, in astate of being associated with each other.

PC 230 has a configuration in which the picture that is image-capturedby each of camera apparatus C11 to C1 n and the voice that issound-picked up in microphone array apparatus MA are monitored, and inwhich voice processing apparatus 250 and picture processing apparatus270 are included.

FIG. 16 is a block diagram illustrating a configuration of voiceprocessing apparatus 250. voice processing apparatus 250 has signalprocessing unit 251, memory 255, communication unit 256, operation unit257, display 258, and speaker 259. Through network 215, communicationunit 256 receives packet PKT (refer to FIG. 18) that is transmittedmicrophone array apparatus MA or recorder apparatus 240, outputs thereceived packet PKT to signal processing unit 251, and transmits presetinformation (refer to FIG. 23) that is generated in signal processingunit 251, to microphone array apparatus MA. Memory 255 is configuredwith, for example, a Random Access Memory (RAM), and functions as a workmemory at the time of operation of each unit of voice processingapparatus 250. Data that is necessary at the time of the operation ofeach unit of voice processing apparatus 250 is stored in memory 255.

Signal processing unit 251 is configured with, for example, a CentralProcessing Unit (CPU), a Micro Processing Unit (MPU), or a DigitalSignal Processor (DSP), has sound source direction detection unit 252,directionality formation unit 253, and input and output control unit254, and performs control processing for generally managing operation ofeach unit of PC 230, input and output processing of data between eachunit, data arithmetic operation (calculation) processing, and datastoring processing.

sound source direction detection unit 252 estimates in which direction asound source from which the voice that is sound-picked up in microphonearray apparatus MA comes is positioned. In the present embodiment, thedirection of the sound source is assumed to be the center of microphonearray apparatus MA, is expressed as a horizontal angle of θ and avertical angle of φ (refer to FIG. 20), and is estimated, for example,using a sound volume. The horizontal angle of θ is an angle within ahorizontal plane (a X-Y plane) in an actual space of which an origin isassumed to the center of microphone array apparatus MA, and the verticalangle φ is an inclination with respect to the Z-axis that passes throughthe center of microphone array apparatus MA. For example, in a casewhere the sound source is below microphone array apparatus MA, thevertical angle of φ is detected as a small value.

Furthermore, in a case where, in order to perform preset processing thatwill be described below, voice is caused to be generated, a method ofnot only simply specifying a sound volume; but also specifying a soundgeneration place (the direction of the sound source) from a soundcharacteristic is effective. When it comes to the method of causingsound to have its own characteristic, for example, a sinusoidal wave ofwhich a frequency is fixed, a sinusoidal wave of which a frequencychanges with a fixed periodicity, white noise that switches between ONand OFF with a fixed periodicity, a registration-completed utteredsentence, and the like are output from speaker 259, and thus the soundthat has a characteristic is obtained. Accordingly, even in a case wherethe preset processing is performed at a noisy place (for example; insideof a room where interior finish work is in progress, or in a shoppingcenter), sound source direction detection unit 252 can specify soundthat is caused to have a characteristic, among pieces of voice that aresound-picked up in microphone array apparatus MA. Furthermore, even in acase where the preset processing is performed at silent night and voiceis caused to be generated, although a sound volume is small, soundsource direction detection unit 252 can specify sound that is caused tohave a characteristic, among pieces of voice that are sound-picked up inmicrophone array apparatus MA.

Furthermore, as the method of specifying the direction of the soundsource that causes, sound that is caused to have a characteristic, to begenerated, there are two methods that follow. As a first method, soundsource direction detection unit 252 performs sound characteristicanalysis sequentially on pieces of voice that are sound-picked up inmicrophone array apparatus MA, starting from a place from which a highsound volume comes, and determines that a direction of sound which isconsistent in characteristic is the direction of the sound source. As asecond method, sound source direction detection unit 252 divides theimage capture target area into areas, searches each area (which is alsoreferred to a division area) that results from the division, for a soundcharacteristic, and determines that a direction of the division areawhich is consistent in characteristic is the direction of the soundsource. As the second method, because sound source direction detectionunit 252 takes in the voice data at a fixed sound pickup time, and thensearches for the sound characteristic, this has no relation with thesound volume, and a low sound volume is sufficient. Therefore, withoutbothering people in the neighborhood, even in a noisy place, sound thatis caused to have a characteristic can be specified. As a third method,it is also possible that it is determined that there is a crosscorrelation with a generation signal. As will be described below, at thetime of the preset processing, as examples of voice that isvoice-generated from the sound source, voice that is generated by ahuman being, sound that is generated by pressing a buzzer, sound that isoutput from a speaker, and the like are given.

Directionality formation unit 253 adds up pieces of voice data that aresound-picked up by microphones M1 to Mn by performing directionalitycontrol processing of voice data, using the voice data that istransferred directly from microphone array apparatus MA or the voicedata that is recorded in recorder apparatus 240, and generates voicedata that forms the directionality to a specific direction, in order toemphasize (amplify) voice (a sound volume level) from a position of eachof microphones M1 to Mn of microphone array apparatus MA to a specificdirection. The specific direction is a direction (which is also referredto as a pointing direction) from microphone array apparatus MA toward aposition in a real space, which corresponds to a position that isdesignated in operation unit 257. A technology relating to thedirectionality control processing of the voice data for forming thedirection of the voice that is sound-picked up by microphone arrayapparatus MA is a known technology as disclosed, for example, inJapanese Patent Unexamined Publication No. 2014-143678 (PTL 1 describedabove) and the like.

Input and output control unit 254 controls input and output of variouspieces of data from operation unit 257 and to display 258 and speaker259. Operation unit 257, for example, is configured with a touch panelor a touch pad which is positioned in a manner that corresponds to ascreen of display 258, and on which an input operation by a user or astylus pen is capable of being performed. In response to a user'soperation, operation unit 257 outputs pieces of data on one or moredesignation points (coordinates) where an emphasis on (amplification of)the sound volume level of the voice data is desirable, to signalprocessing unit 251. Operation unit 257 may be configured with apointing device, such as a mouse or a keyboard.

Voice map 265 indicating a position of the sound source that isestimated in sound source direction detection unit 252 is displayed ondisplay 258. Speaker 259 outputs the voice data that is sound-picked upby microphone array apparatus MA and is transferred through network 215or the voice data that is recorded in recorder apparatus 240, or voicedata on which processing for emphasis in a specific direction bydirectionality formation unit 253 is performed based on the voice datathat is sound-picked and transferred.

On the other hand, picture processing apparatus 270 operatesindependently without operating in conjunction with voice processingapparatus 250, and, according to a user's operational instruction,performs control that display the picture data which is image-capturedby each of fixation camera apparatuses C11 to C1 n, on display 258. Moreprecisely, when the user selects a camera apparatus to be used, pictureprocessing apparatus 270 displays a picture that is captured by theselected camera apparatus on equipped camera monitor 271.

FIG. 17 is a block diagram illustrating microphone array apparatus MA.Microphone array apparatus MA sound-picks up voice in all directions (atan angle of 360 degrees), and has a plurality of microphone units (whichare referred to as a microphone for short) M1 to Mn (n=8), a pluralityof amplifier (amp) PA1 to PAn that amplify output signals, respectively,of the plurality of microphone units M1 to Mn, a plurality of A/Dconverters A1 to An that convert analog signals that are output from theplurality amplifiers PA1 to PAn, into digital signals, respectively,coding unit 225, storage unit 224, and transmission unit 226.

The preset information, which indicates a correspondence relationshipbetween each of predetermined positions (preset positions P1 to Pn) thatare image-captured by camera apparatuses C11 to C1 n and the pointingdirection (which, specifically, is a set of the horizontal angle of θand the vertical angle of φ) from microphone array apparatus MA, isstored in storage unit 224. Coding unit 225 adds the preset informationthat is stored in storage unit 224, to digital voice signals that areoutput from A/D converters A1 to An, and generates packet PKT of thevoice data. Transmission unit 226 transmits packet PKT of the voice datathat is generated in coding unit 225, to voice processing apparatus 250through network 215.

In this manner, microphone array apparatus MA amplifies the outputssignals of microphones M1 to Mn in amplifiers PA1 to PAn, respectively,and convert the resulting signals into the digital voice signals in A/Dconverters A1 to An, respectively. Thereafter, the preset informationthat is stored in storage unit 224 is added to the digital voice signal,the packet PKT of the voice data is generated, and the packet PKT of thevoice data is transmitted to voice processing apparatus 250 within PC230 through network 215.

FIG. 18 is a diagram illustrating a structure of packet. PKT of thevoice data that is transmitted from microphone array apparatus MA tovoice processing apparatus 250. Packet PKT of the voice data isconfigured with a header and the voice data that is a payload. Thepreset information described above is included in the header.

In the present embodiment, the preset information is included in packetPKT of the voice data, and thus the preset information is transmittedfrom microphone array apparatus MA to voice processing apparatus 250,but voice processing apparatus 250 may acquire the preset informationusing other methods. For example, when voice processing apparatus 250reads initial information of microphone array apparatus MA, the presetinformation may be read together. Furthermore, microphone arrayapparatus MA may transmit the preset information in response to arequest from voice processing apparatus 250.

FIG. 19 is a diagram illustrating a layout of the inside of store 210where monitoring system 205 is installed. As one example, “Doorway,”“Register R1,” “Register R2,” “three rows of Commodity Shelves,” “LunchShelf,” “Daily Dish Shelf,” “Drink Shelf,” and “Magazine Shelf” arearranged in store 210, such as a convenience store. Microphone arrayapparatus MA is installed on a ceiling of store 210, and a plurality ofcamera apparatuses C11 to C1 n (n=4, here) are installed on an upperportion of a wall or the ceiling of store 210.

In FIG. 19, the plurality of camera apparatuses C11 to C1 n faces towarda plurality of preset positions P1 to Pn that are installed in advancewithin store 210, respectively, in such a manner as to image-capture theplurality of preset positions P1 to Pn. The plurality of presetpositions P1 to Pn are decided by the user as the image capture targetarea that is monitored. Image capture ranges of pictures that areimage-captured by camera apparatuses C11 to C1 n are expressed as CR1 toCRn, respectively, and preset positions P1 to Pn are present atapproximately the centers of image capture ranges CR1 to CRn,respectively.

Camera apparatuses C11 and C12 image-capture pictures, respectively, ofpreset position P1 of “Register R1” that is a monitoring target area (asound pickup area) and preset position P2 of “Register R2” that is amonitoring target area. Camera apparatus C13 image-captures a picture ofpreset information P3 of “Magazine Shelf” that is a sound pickup area.In this manner, in a case where microphone array apparatus MA isattached later within store 210 where camera apparatuses C11 to C14 arealready installed, the preset processing that will be described below isperformed.

Operation of monitoring system 205 with the configuration describedabove is described. First, the preset processing that is performedbefore monitoring system 205 starts to run (to perform monitoring) willbe described. At this point, the preset processing is processing thatsets the direction (more precisely, the horizontal angle of θ and thevertical angle of φ with respect to the center of microphone arrayapparatus MA) from microphone array apparatus MA toward each ofpredetermined positions (preset positions) P1 to Pn within store 210.FIG. 20 is a diagram for describing an outline of the preset processing.Microphone array apparatus NIA and camera apparatuses C11 to C1 n areinstalled on ceiling RF of store 210. Camera apparatuses C11 to C1 nthat are fixation cameras face toward preset positions P1 to Pn,respectively, and pictures that are mage-captured by camera apparatusesC11 to C1 n, respectively, are displayed on camera monitor 271, withwhich picture processing apparatus 270 is equipped. In work thataccompanies the preset processing, a sound source (sound generationsource) is placed in preset positions P1 to Pn. As examples of the soundsource, as described above, for example, voice that is generated by ahuman being, sound that is generated by pressing a buzzer, sound that isoutput from a speaker, and the like are given. In FIG. 20, a case wherea speaker 281 who stands on floor FLR generates voice at positions P1 toPn is illustrated. Microphone array apparatus MA, when sound-picking upvoice, transmits voice data on the voice to voice processing apparatus250. Voice processing apparatus 250 displays a sound generation sourceposition of the voice that is sound-picked up, as a sound source mark(marker) SD, on the screen (voice map 265 that will be described below)of display 258.

Display 258, which is included in voice processing apparatus 250, may besubstituted for camera monitor 271. Furthermore, in a case where voiceprocessing apparatus 250 and picture processing apparatus 270 areintegrated into one piece as a monitoring apparatus, a screen (a window)of a display is switched, and thus it is possible that the display isused as a camera monitor and it is also possible that both are displayedat the same time using divisional display.

FIG. 21 is a flowchart illustrating a sound pickup procedure at the timeof the preset processing and the monitoring. Microphone array apparatusMA is attached to the ceiling of store 210, and then the presetprocessing is performed. First, voice processing apparatus 250 performsinitial setting of the microphone array apparatus MA (S31). In theinitial setting, voice processing apparatus 250 sets an IP address ofthe microphone array apparatus MA, and microphone array apparatus MA isset to be in a state where communication is possible. Additionally,voice processing apparatus 250 enters a preset mode, and causes voicemap 265 (refer to FIG. 22) to be displayed on display 258.

When the initial setting of microphone array apparatus MA is finished,the sound source (the sound generation source) is placed in presetpositions P1 to Pn toward which camera apparatuses C11 to C1 n face,respectively, and the sound source generates sound in a prescribed soundvolume or higher for a prescribed time (S32). At this point, the speaker281 as the sound source generates voice in preset positions P1 to Pn.Microphone array apparatus MA sound-picks up the voice, and transmit theresulting voice data to voice processing apparatus 250.

Communication unit 256 within voice processing apparatus 250 receivesthe voice data that is transmitted from microphone array apparatus MA(S33). Sound source direction detection unit 252 within voice processingapparatus 250 obtains the pointing direction (the horizontal angle of θand the vertical angle of φ) from microphone array apparatus MA towardthe sound source, based on the sound volume of the received voice data,and displays sound source mark SD1 indicating the sound generationsource position, on voice map 265 that is displayed on display 258(S34). FIG. 22 is a diagram illustrating the screen of display 258 onwhich voice map 265 is displayed at the time of the preset processing.

Voice map 265 is drawn as three concentric circles 265 h, 265 i, and 265j, of which central points O are a position of microphone arrayapparatus MA, and line segment 265 m that is a radius that dividescentral angles of these concentric circles into twelve portions. Amongthree concentric circles, the innermost concentric circle is 265 h isequivalent to the vertical angle φ=30°, the middle concentric circle 265i is equivalent to the vertical angel φ=60°, and the outermostconcentric circle 265 j is equivalent to the vertical angle φ=90°.Therefore, the more inward sound source mark SD1 is present, the moremicrophone array apparatus MA is approached. Line segment 265 m thatextends in the horizontal direction to the right side of central point Ois equivalent to the central angle=0° and the horizontal angle=0°. Linesegments 265 m that result from the division into twelve portionsindicate horizontal angles 0° to 360° with the central angle is startedwith the central angle=0° and incremented by 30°. At this point, soundsource mark SD1 is drawn at coordinates (240°, 70°), as coordinates (θ,φ) on voice map 265. Because sound source mark SD1 is not yet confirmedas the sound source in the preset position, sound source mark SD1 isdrawn as a rectangle. In FIG. 22, angles of 30°, 60°, and the like areattached, but may not be displayed. Furthermore, graduations may bedisplayed, and for example, the vertical angle may be displayed by 15°.

The user (the speaker) selects sound source mark SD1 that is displayedon display 258, using cursor 287, and inputs information (camerainformation) of camera apparatus C11 that corresponds to sound sourcemark SD1 (S35). When sound source mark SD1 is selected, input box 288for the camera information is displayed on a lower right corner of thescreen of display 258. A place (for example, Register R1) and a cameraIP address are capable of being input by the user into input box 288 forthe camera information. Instead of the user selecting the sound sourcemark, in a case where a sound volume of voice that is sound-picked up bythe microphone array apparatus is a prescribed threshold or above for aprescribed time, a voice processing apparatus may automaticallyrecognize a direction in which the voice occurs, may cause the soundsource mark to be displayed on the display, and may urge the user toinput the camera information.

Voice processing apparatus 250 reads the horizontal angle θ and thevertical angle φ of sound source mark SD1 (S36), and transmits thecamera information that is input in Step S35 and the horizontal value θand the vertical angle φ of sound source mark SD1 to microphone arrayapparatus MA (S37). Microphone array apparatus MA lists the presetinformation, which is transmitted from voice processing apparatus 250,in preset information table 290 (refer to FIG. 23) for storing instorage unit 224.

FIG. 23 is a diagram illustrating a detail of listing in presetinformation table 290 that is stored in microphone array apparatus MA.The camera IP address, the place, and the pointing direction (thehorizontal angle θ and the vertical angle φ) are listed in presetinformation table 290 for each of preset positions P1 to Pn that areincluded in the image capture target areas, respectively, for the cameraapparatuses C11 to C1 n.

The user determines whether or not setting processing of the presetposition is ended (S38). In a case where the setting processing of allpieces of preset positions are not ended, the preset processing returnsto Step S32 and the same processing is repeated.

On the other hand, when the preset processing is ended for the presetpositions, the preset processing before starting the running iscompleted, the running (actual monitoring processing) is started. Voiceprocessing apparatus 250 acquires the voice data from microphone arrayapparatus MA, and displays all preset positions, which are extractedfrom the preset information that is included in the header of packet PKTof the voice data, on display 258 (S39). FIG. 24 is a diagramillustrating the screen of display 258 that is displayed after thepreset processing. Sound source maps SD1, SD2, SD3, and SD4, which areconfirmed as the directions (the pointing directions) of the soundsources, respectively, that are “Register R1,”, “Register R2,” “MagazineShelf,” “Food Shelf” that are the preset positions, are drawn in theform of a circle on voice map 265 that is displayed on the screen ofdisplay 258. Particularly, in a case where there is no need todistinguish between sound source marks SD1 to SD4, sound source marksSD1 to SD4 are simply collectively referred to as a sound source markSD. Furthermore, camera information 267 is displayed on a lower rightcorner of the screen of display 258. Pieces of preset information andcamera IP addresses, which correspond to camera apparatuses C11 to C1 n,respectively, are included in camera information 267.

When the preset position that is displayed on display 258 is designatedby the user through operation unit 257 (S40), voice processing apparatus250 reads the horizontal angle θ and the vertical angle φ that indicatethe pointing direction of the designated preset position (S41).Directionality formation unit 253 within voice processing apparatus 250forms the directionality of the voice data in the pointing direction ofthe preset position that is specified by the horizontal angle θ and thevertical angle φ which are read, and outputs voice from speaker 259(S42).

Furthermore, when the user designates other one position that isdisplayed on display 258 while reproducing the voice that issound-picked up at the time of the running (more precisely, actualmonitoring) (YES in S43), voice processing apparatus 250 reads thehorizontal angle and the vertical angle from the preset information onthe designated position (S41), and forms the directionality of the voicedata in the pointing direction and outputs the voice from speaker 259(S42).

On the other hand, if there is no designation of a new designationposition (NO in S43), voice processing apparatus 250 continues toperform the reproduction until a power source is turned OFF (S44).Moreover, because according to a user's instruction, the directionalityformation does not continue until the power source is turned OFF,according to user's support, the directionality formation may becancelled and all pieces of sound may be monitored until the nextposition designation occurs.

FIG. 25 is a diagram illustrating the screen of the display 258 that isdisplayed at the time of the monitoring and a sound generation operationby speaker 259. At the time of the running (the monitoring), forexample, when the user selects sound source mark SD3 through operationunit 257, voice processing apparatus 250 forms the directionality of thevoice data in the pointing direction (θ3, φ3) of sound source mark SD3,more precisely, the direction of the magazine shelf (preset positionP3), sound-picks up voice, and outputs the voice from speaker 259. Atthis time, when detecting abnormal sound (sound in a high volume), ofwhich a sound volume exceeds a threshold, in preset position P3 or inthe vicinity thereof, voice processing apparatus 250 alerts the user tothe detection by causing sound source mark SD3 to blink. At this point.the sound source mark is displayed in a manner that the sound sourcemark is identifiable by being blinked, but color, a shape, a size, orthe like may be changed. Furthermore, according to how high a soundvolume of the preset position is, voice processing apparatus 250 maycause sound source mark color to be changed, may cause a size, a shape,or the like of the sound source mark to be changed, or may cause aletter color or a background color of only the involved camerainformation, among pieces of camera information, to be changed.

As described above, in monitoring system 205 according to the fifthembodiment, when microphone array apparatus MA is attached later withinstore 210 where camera apparatuses C11 to C14 are already installed, inthe preset processing, the sound source is placed in the presetpositions P1 to Pn in an optical axis direction that are the centers ofimages which are captured by camera apparatuses C11 to C1 n,respectively, and voice is generated. When the microphone arrayapparatus MA sound-picks up the voice that is output from the soundsource and sends the voice data on the voice to voice processingapparatus 250, sound source direction detection unit 252 causes soundsource marks SD1 to SD4, each of which indicates the pointing direction(the horizontal angle θ, the vertical angle φ), to be displayed ondisplay 258, and urges the user to make a selection among sound sourcemarks SD1 to SD4 and to input the camera information. Voice processingapparatus 250 transmits the camera information that is input, and thepointing direction, to the microphone array apparatus MA. The microphonearray apparatus MA lists the camera information and the pointingdirection in preset information table 290 for storing in storage unit224. At the time of the running, when the user selects any one of soundsource marks SD1 to SD4 on voice map 265 that is displayed on display258, directionality formation unit 253 forms the directionality of thevoice data in the pointing direction (the horizontal angle θ, thevertical angle φ) that corresponds to the sound source map, for thevoice data that is sound-picked up in microphone array apparatus MA, andvoice processing apparatus 250 causes voice to be output from speaker259.

Accordingly, even in a case where a positional relationship between eachof camera apparatuses C11 to C1 n and microphone array apparatus MA isunclear, monitoring system 205 can form the directionality in thepointing direction from microphone array apparatus MA toward apredetermined image capture position (more precisely, preset positionsP1 to Pn), and thus, the voice from the sound source, which issound-generated, can be clearly heard. Therefore, in order to obtain acorrespondence relationship between a coordinate system for the cameraapparatus and a coordinate system for the microphone array apparatus,monitoring system 205 can measure an attachment position, a direction,or the like on the spot, or can associate the camera apparatus and themicrophone array apparatus with each other without the need to performgeometric calculation. Furthermore, voice processing apparatus 250 canobtain the preset information using only microphone array apparatus MA.

Furthermore, at the time of the running (the monitoring), monitoringsystem 205 can form the directionality of the voice data in the pointingdirection (the horizontal angle θ and the vertical angle φ) that isassociated with the preset position based on the preset information, andcan output the voice that is sound-picked up in the preset position,from speaker 259.

Furthermore, voice map 265 that is drawn as a concentric circle of whichthe center is microphone array apparatus MA, of which the center angleindicates the horizontal angle, and a length of whose radius indicates asize of the vertical angle is displayed on display 258, and sound sourcemap SDn is displayed on voice map 265. Because of this, the user caneasily recognize preset information Pn that is indicated on sound sourcemap SDn.

Furthermore, when sound source map SD that is displayed on voice map 265is designated through operation unit 257, because directionalityformation unit 253 forms the directionality of the voice data of thevoice that is sound-picked up in the microphone array apparatus MA, inthe pointing direction that is associated with designated sound sourcemap SD, the user can hear voice that is generated in preset position Pn,with a simple operation.

Furthermore, in a case where a sound volume of the voice that isgenerated in preset position P exceeds a threshold, because sound sourcemap SD thereof is displayed in a blinking manner, display 258 canpromptly alert the user that sound (abnormal sound) that is so strongthat the sound volume exceeds the threshold is detected.

Furthermore, because the preset information is written in the header ofpacket PKT of the voice data, information in the pointing direction ofthe preset position can be obtained only with the voice data.Furthermore, because preset information table 290 is stored in storageunit 224 of microphone array apparatus MA, even in a case where aplurality of microphone arrays are installed, the correspondencerelationship between microphone array apparatus MA and the presetinformation does not even need to be managed.

Sixth Embodiment

In the fifth embodiment, the voice processing apparatus and the pictureprocessing apparatus operates separately of each other, but in the sixthembodiment, a case is described where the voice processing apparatus andthe picture processing apparatus are integrated into one piece and wherethe picture that is image-captured with the camera apparatus and thevoice map are displayed at the same time on the screen of the display.

FIG. 26 is a block diagram illustrating a configuration of monitoringsystem 205A according to the sixth embodiment. The monitoring systemaccording to the sixth embodiment has a configuration that is almost thesame as that according to the fifth embodiment. Constituent elementsthat are the same as those in the fifth embodiment are given the samereference numerals, and descriptions thereof are omitted.

In monitoring system 205A, monitoring apparatus 300 is connected tonetwork 215. Monitoring apparatus 300 has voice processing unit 305,picture processing unit 307, operation unit 317, display 318, andspeaker 319.

Voice processing unit 305 has functions of sound source directiondetection unit 252, directionality formation unit 253, and input andoutput control unit 254, in the same manner as signal processing unit251 within voice processing apparatus 250 according to the fifthembodiment. According to a user's operational instruction, pictureprocessing unit 307 causes the picture data, which is image-captured byeach of fixation camera apparatuses C11 to C1 n, to be displayed ondisplay 318.

Operation unit 317 is installed, for example, in a manner thatcorresponds to the screen of display 318, and is configured with a touchpanel or a touch pad on which an input operation is capable of beingperformed using a user's finger and a stylus. In response to a user'soperation, operation unit 317 outputs pieces of data that arecoordinates of one or more designation points where an emphasis on(amplification of) the sound volume level of the voice data isdesirable, to voice processing unit 305. Operation unit 317 may beconfigured with a pointing device, such as a mouse or a keyboard.

A picture (an image) that is based on the picture data, which isimage-captured by each of camera apparatuses C11 to C1 n and istransferred through network 215, or which is recorded in recorderapparatus 240, is displayed on display 318.

Speaker 319 outputs the voice data that is sound-picked up by microphonearray apparatus MA and is transferred through network 215, or that isrecorded in recorder apparatus 240, or the voice data on which theprocessing for emphasis in a specific direction by voice processing unit305, which is based on the audio data that is sound-picked up andtransferred, is performed.

Operation of monitoring system 205A that has the configuration describedabove is described. FIG. 27 is a flowchart indicating the presetprocessing and the sound pickup procedure at the time of the monitoring.FIG. 28 is a flowchart indicating the preset processing and the soundpickup procedure at the time of the monitoring, which follows theflowchart in FIG. 27. The same step processing as in the fifthembodiment is given the same step number, and a description thereof isomitted.

In Step S31, voice processing unit 305 sets the IP address of themicrophone array apparatus MA, and microphone array apparatus MA is setto be in the state where communication is possible. Additionally, voiceprocessing unit 305 enters the preset mode, and causes voice map 265 tobe displayed on display 318. Picture processing unit 307 performsbroadcast on all camera apparatuses C11 to C1 n that are connected tonetwork 215, and receives responses to the broadcast, and thus searchescamera apparatuses C11 to C1 n that are connected to network 215, fordetection (S31A).

Voice processing unit 305 stores the total number n of cameraapparatuses that are obtained as a result of the search, and each IPaddress, in a memory (not illustrated) within the voice processing unit305 (S31B). Picture processing unit 307 displays a picture that isimage-captured by each of searched camera apparatuses C11 to C1 n, onthe screen of display 318. FIG. 29 is a diagram illustrating the screenof display 318 that is displayed at the time of the preset processing.Thumbnails SZ1 to SZ4 for the pictures that are image-captured by cameraapparatuses C11 to C1 n, respectively, are displayed to the left side ofthe screen of display 318 in such a manner that thumbnails SZ1 to SZ4are selectable. Particularly, in a case where there is no need todistinguish between thumbnails SZ1 to SZ4, thumbnails SZ1 to SZ4 arereferred to as thumbnail SZ for short. Furthermore, thumbnail SZ isdisplayed by taking a still image out of the picture that isimage-captured by each of camera apparatuses C11 to C1 n, every fixedtime. Furthermore, voice processing unit 305 displays voice map 265 onthe right side of the center of the screen of display 318.

Voice processing unit 305 sets variable i indicating a number of thecamera apparatus to initial value 0 (S31C). Then, voice processing unit305 increases variable i by 1 (S31D). Voice processing unit 305 receivesthumbnail SZ that is selected by the user through operation unit 317(S31E). When it comes to the selection of the thumbnail, the user movescursor 323 that is displayed on the screen of display 318, to selectthumbnail SZ. In FIG. 29, thumbnail SZ3 is selected. A frame of selectedthumbnail SZ3 is displayed in a highlighting manner. Instead ofthumbnail SZ, as is the case with the fifth embodiment, sound sourcemark SD may be selected.

The user installs the sound source in the image capture range that isimage-captured in one of camera apparatuses C11 to C1 n, whichcorresponds to thumbnail SZ, and causes the sound source to generatesound in a prescribed sound volume or higher for a prescribed time(S32). A position of the sound source may not be on the optical axis ofeach of camera apparatuses C11 to C1 n, and may be in the image capturerange. Furthermore, the user, as the sound source, may generate voicestanding within the image capture range, as is the case with the fifthembodiment.

When microphone array apparatus MA sound-picks up voice in a prescribedsound volume or higher, which is generated from the sound source andtransmits voice data on the voice to voice processing unit 305, voiceprocessing unit 305 receives the voice data that is transmitted frommicrophone array apparatus MA (S33).

Voice processing unit 305 obtains the pointing direction (the horizontalangle θ and the vertical angle φ) from microphone array apparatus MAtoward the sound source, based on the sound volume of the received voicedata, and displays sound source SD indicating the sound generationsource position, on voice map 265 that is displayed on display 318(S34). Sound source mark SD3 indicating a new sound source position isdrawn in the form of a rectangle on display 318 (refer to FIG. 29).Sound source marks SD1 and SD2 that are “Register R1,” and “RegisterR2,” respectively, are already confirmed, and because of this, are drawnin the form of a circle. Additionally, voice processing unit 305displays input box 329 for a camera name (for example, a place name) ona lower right corner of the screen of display 318 to urge inputting.

The user selects thumbnail SZ or source mark SD, and inputs the camerainformation into input box 329 for the camera name (535A). Instead ofthe user selecting thumbnail SZ or sound source mark SD, in the casewhere the sound volume of voice that is sound-picked up by themicrophone array apparatus is a prescribed threshold or above for aprescribed time, the voice processing apparatus may automaticallyrecognize a direction in which the voice occurs, may cause the soundsource mark to be displayed on the display, and may urge the user toinput the camera information.

Voice processing unit 305 reads the horizontal angle θ and the verticalangle φ of sound source mark SD (S36), and transmits the camerainformation (the camera name and the IP address) that is input in StepS35A, and the pointing direction (horizontal angle θ and the verticalangle φ) of microphone array apparatus MA, to microphone array apparatusMA (S37). Microphone array apparatus MA lists the preset information,which is transmitted from voice processing unit 305, in presetinformation table 290 for storing in storage unit 224.

Voice processing unit 305 determines whether or not variable i reachesthe total number n of camera apparatuses that are searched (S38A). In acase where variable i does not reach the total number n of cameraapparatuses, voice processing unit 305 returns Step S31D and repeats thesame processing. On the other hand, in a case where variable i reachesthe total number n of camera apparatuses, the preset processing iscompleted, and proceeding to the processing at the time of the running(the monitoring) takes place.

At the time of the monitoring, voice processing unit 305 acquires allpreset positions P1 to Pn from microphone array apparatus MA, anddisplays acquired preset positions P1 to Pn on voice map 265 that isdisplayed on display 318 (S39A). Picture processing unit 307 reads apicture that is image-captured by each of camera apparatuses C11 to C1n, and displays the picture, which is read, on the screen of display 318(539B). FIG. 30 is a diagram illustrating the screen of display 318 thatis displayed at the time of the monitoring, and a sound generationoperation by speaker 319. At this point, a case is described where eightcamera apparatuses C11 to C18 are installed. Images GZ1 to GZ8 that areimage-captured in camera apparatuses C11 to C18, respectively, aredisplayed on the left side of the screen of display 318. At this point,images GZ1 to GZ8 are not thumbnails, and are images that results fromcamera apparatuses C11 to C18 image-capturing “Register R1,” “RegisterR2,” “Register R3,” “Entrance, “Magazine Shelf T2,” “Magazine Shelf T1,”“Passage,” and “Staff Entrance.”

Voice map 265 and operation panel 340 are displayed on the right side ofthe screen of display 318. Sound source marks SD1 to SD8 are displayedon voice map 265. Furthermore, luminance button 341 for adjustingbrightness of each of images GZ1 to GZ8, focus button 342 for adjustinga focus of the picture that is image-captured in each of cameraapparatuses C11 to C18, selection button 343 for selecting any one ofcamera apparatuses C11 to C18, sound volume button 345 for adjusting thesound volume, and preset button 346 for switching a state of the soundpickup for the directionality to a state of the sound pickup of allpieces of sound are provided on operation panel 340.

In a case where voice is output from speaker 319, voice processing unit305 receives sound source mark SD or image GZ that is designated by theuser (S40A). The user makes a selection by clicking sound source mark SDon voice map 265 that is displayed on the screen of display 318, using acurser 323, or makes a selection by clicking images GZ1 to GZ8 that aredisplayed on the screen of display 318, using cursor 323. At this point,image GZ5 or sound source mark SD5 is selected, a frame of image GZ5 isdisplayed in a red-color highlighting manner, and sound source mark SD5is surrounded by a rectangle that assumes a red color as a backgroundcolor. Speaker 319 outputs voice data with “Magazine Shelf T2” being setto be the pointing direction.

Furthermore, when the user designates other one position that isdisplayed on display 258 while reproducing the voice that issound-picked up at the time of the running (more precisely, actualmonitoring) YES in S43), voice processing apparatus 250 reads thehorizontal angle and the vertical angle from the preset information onthe designated position (S41), and forms the directionality of the voicedata in the pointing direction and outputs the voice from speaker 259(S42).

On the other hand, if there is no designation of a new designationposition (NO in S43), voice processing apparatus 250 continues toperform the reproduction until a power source is turned OFF (S44). Theuser can newly add, change, and delete contents of the preset table byclicking preset button 346, and may monitor all pieces of sound untilthe next position is designated.

As described above, monitoring system 205A according to the sixthembodiment preset processing that lists the pointing direction of thevoice which corresponds to the preset position, while actually viewingthe picture that is image-picked up in each of camera apparatuses C11 toC1 n, and thus the work efficiency at the time of the preset processingis improved. For example, in a case where the sound source is placed atthe preset information (as well as in a case where the speaker stands inthe preset position), it is simply understood that what is necessary isto place the sound source at the center of the picture that isimage-captured in the camera apparatus. Furthermore, at the time of themonitoring, in a case where the pointing direction of microphone arrayapparatus MA is switched, the user can decide a switching destination byviewing the picture that is manage-captured in each of cameraapparatuses C11 to C1 n.

Seventh Embodiment

In the fifth and sixth embodiments, the preset information is stored inthe microphone array apparatus, but in a seventh embodiment, a casewhere is described where a plurality of microphone arrays are installedand the monitoring apparatus manages the preset information in a unifiedmanner.

FIG. 31 is a block diagram illustrating a configuration of monitoringsystem 205B according to the seventh embodiment. The monitoring systemaccording to the seventh embodiment has a configuration that is almostthe same as that according to the fifth embodiment. Constituent elementsthat are the same as those in the fifth embodiment are given the samereference numerals, and descriptions thereof are omitted.

A plurality of microphone array apparatuses MA1 to MAm are connected tonetwork 215. A plurality of microphone array apparatuses MA1 to MAm aredifferent from those according to the fifth to sixth embodiment, anddoes not have a storage unit in which the present information is stored.Each of microphone array apparatuses MA1 to MAm, for example, may employthe same configuration as any one of microphone arrays 5, 5A, 5B, and 5Caccording to the first to fourth embodiments.

Monitoring apparatus 300A has table memory 310 in which presetinformation table 330 in which the preset information is listed isstored. FIG. 32 is a diagram illustrating a detail of listing of presetinformation table 330 that is stored in table memory 310.

A place, a preset value, and a camera IP address are listed in presetinformation table 330. Furthermore, as the preset value, a number (MicNo=) of the microphone array apparatus, a pointing direction, and adirectionality control parameter are listed. The directionality controlparameter is a directionality filter coefficient, and is decided byperforming learning control in each of the pointing directions.

As the preset information, for example, place: Register R1, Mic No.=MA1,pointing direction: (θ11, φ11), directionality control parameter (p311,. . . , p11q), camera IP address: “165.254.10.11” are listed.Furthermore, in Magazine Shelf, pieces of preset information areredundantly listed by microphone array apparatus MA1 and microphonearray apparatus MA2. That is, as the preset information, place: MagazineShelf, Mic No.=MA1, pointing direction: (θ13, φ13), directionalitycontrol parameter (p131, . . . p13q), and camera IP address:“165.254.10.13”, and place: Magazine Shelf, Mic No.=MA2, pointingdirection: (θ23, φ23), directionality control parameter (p231, . . . ,p23q), and camera IP address: “165.254.10.13” are both listed. In a casewhere pieces of preset information are listed redundantly for the samepreset position, preset information on a voice in a high sound volume ispreferentially used among pieces of voice that are sound-picked up intwo microphone array apparatuses MA1 and MA2, and voice that issound-picked up in microphone array apparatus MA that corresponds to thepreset information is output from speaker 319.

Operation of monitoring system 205B that has the configuration describedabove is described. FIG. 33 is a flowchart illustrating a presetprocessing procedure. The same step processing as in the fifth and sixthembodiments is given the same step number, and a description thereof isomitted. Furthermore, a case where there are two microphone arrayapparatuses MA1 and MA2 is described.

Voice processing unit 305 within monitoring apparatus 300A sets an IPaddress for each of microphone array apparatuses MA1 and MA2, and seteach of microphone array apparatuses MA1 and MA2 to be in a state wherecommunication is possible (Step S31Z). Additionally, voice processingunit 305 enters the preset mode, and displays two voice maps 265A and265B on display 318. Picture processing unit 307 performs the broadcaston all camera apparatuses C11 to C1 n that are connected to network 215,and receives responses to the broadcast, and thus searches cameraapparatuses C11 to C1 n that are connected to network 215 (S31A).

Voice processing unit 305 stores the total number n of cameraapparatuses that are obtained as a result of the search, and each IPaddress, in the memory (not illustrated) within the voice processingunit 305 (S31B). Picture processing unit 307 displays a picture that isimage-captured by each of searched camera apparatuses C11 to C1 n, onthe screen of display 318. FIG. 34 is a diagram illustrating the screenof display 318 that is displayed at the time of the preset processing.Thumbnails SZ1 to SZ4 for the pictures that are image-captured by cameraapparatuses C11 to C1 n, respectively, are displayed to the left side ofthe screen of display 318 in such a manner that thumbnails SZ1 to SZ4are selectable. Particularly, in a case where there is no need todistinguish between thumbnails SZ1 to SZ4, thumbnails SZ1 to SZ4 arecollectively referred to as thumbnail SZ for short. Furthermore,thumbnail SZ is displayed by taking a still image out of the picturethat is image-captured by each of camera apparatuses C11 to C1 n, everyfixed time. Furthermore, voice processing unit 305 displays two voicemaps 265A and 265B on the right side of the center of the screen ofdisplay 318.

In Step S32, the user installs the sound source in the image capturerange that is image-captured in one of camera apparatuses C11 to C1 n,which corresponds to thumbnail SZ, and, when sound is caused to begenerated in a prescribed sound volume or higher for a prescribed time,each of microphone array apparatuses MA1 to MAm sound-picks up voice inthe prescribed sound volume or higher, which is generated from the soundsource, and transmits each piece of voice data to voice processing unit305. Voice processing unit 305 receives the voice data that istransmitted from each of microphone array apparatuses MA1 and MA2(S33A).

Voice processing unit 305 obtains the pointing direction (the horizontalangle θ and the vertical angle φ) from each of microphone arrayapparatuses MA1 and MA2 toward the sound source, based on the soundvolume of the voice data that is received from each of microphone arrayapparatuses MA1 and MA2, and displays sound source map SD (SD3A andSD3B) indicating the sound generation source position on voice maps 265Aand 265B that are displayed on display 318 (534A). Sound source mapsSD3A and SD3B indicating new sound generation source position are drawnon display 318 (refer to FIG. 34). Sound source marks SD1 and SD2 thatare “Register R1,” and “Register R2,” respectively, are alreadyconfirmed, and because of this, are drawn in the form of a circle.Additionally, voice processing unit 305 displays input box 329 for acamera name (for example, a place name) on a lower right corner of thescreen of display 318 to urge inputting. At this point, voice that issound-picked up in microphone array apparatus MA2 has a high soundvolume, a rectangle of sound source map SD3B that corresponds to thisvoice is larger in size than sound source map SD3A that corresponds tomicrophone array apparatus MA1.

Voice processing unit 305 selects any one of the plurality of microphonearray apparatuses (microphone array apparatuses MA1 and MA2 here), whichis a target for the preset processing (S34B). In a case where voice in aprescribed sound volume or higher is sound-picked up in the plurality ofmicrophone array apparatuses, the selection of microphone arrayapparatuses MA1 and MA2 is made using any one of the following threemethods. In a first method, voice processing unit 305 selects amicrophone array apparatus that sound-picks up sound in a high soundvolume, from between microphone array apparatuses MA1 and MA2. In asecond method, the user selects one of microphone array apparatuses MA1and MA2. In a third method, the sound volume is compared with athreshold, and a microphone array apparatus that picks up a sound in asound volume that is at a threshold or higher. In this case, in somecases, a plurality of microphone array apparatuses are selected.

The user selects thumbnail SZ or sound source mark SD for the selectedmicrophone array apparatus MA and inputs the camera information intoinput box 329 for the camera name (S35B). Processing in subsequent StepsS36 to S38A is the same as in the sixth embodiment. In Step S38A, whenvariable i reaches the total number n of camera apparatuses in StepS38A, voice processing unit 305 ends the present operation.

FIG. 35 is a flowchart illustrating the sound pickup procedure at thetime of the monitoring. Picture processing unit 307 within monitoringapparatus 300A selects any one of camera apparatuses C11 to C1 n, anddisplays the picture that is image-captured in the selected cameraapparatuses C11 to C1 n, to display 318 (S51).

FIG. 36 is a diagram illustrating the screen of display 318 that isdisplayed at the time of the monitoring, and the sound generationoperation by speaker 319. Pull down menu 360 for various items isdisplayed on the left side of the screen of display 318. At this point,the pull down menu in an apparatus hierarchical tree is spread, and astate where camera apparatus C12 is selected appears. Monitor screen 350on which the picture that is image-captured in selected camera apparatusC12 is displayed is positioned on the upper portion of approximately thecenter of the screen of display 318. Operation panel 340A is positionedon the lower portion of approximately the center of the screen ofdisplay 318. Provided on operation panel 340A are luminance button 341for adjusting brightness of the picture, focus button 342A for adjustinga focus of the picture that is image-captured in each of cameraapparatuses C11 to C18, selection button 343 for selecting any of cameraapparatuses C11 to C18, and zoom button 347 for performing a zoomingoperation. A preset input box in which a new preset position is input ina case where the new preset position is added may be provided onoperation panel 340A.

Voice processing unit 305 reads the preset information that correspondsto the selected camera information (S52). Voice processing unit 305forms the directionality of the voice data in the pointing direction(the horizontal angle θ and the vertical angle φ) that is obtained fromthe preset information (S53). Voice processing unit 305 determineswhether or not a plurality of microphone array apparatuses MA go throughthe preset processing (S54). In a case where there are a plurality ofmicrophone array apparatuses MA, voice processing unit 305, for example,selects microphone array apparatus MA that sound-picks up sound in thehighest sound volume which is decided at the time of the presetprocessing (S55).

Voice processing unit 305 outputs the voice data of which thedirectionality is formed in the selected microphone array apparatus MA,from speaker 319 (S56). In FIG. 36, voice “Welcome” is output fromspeaker 319. Thereafter, voice processing unit 305 returns to Step S51,and repeats the same operation.

As described above, in monitoring system 205B according to the seventhembodiment, because a plurality of microphone array apparatuses MA areincluded, sound can be picked up using the microphone array apparatusthat is in a position where the user easily hears the voice that isgenerated within the store. Furthermore, in a case where it is possiblethat the plurality of microphone array apparatuses sound-pick up voicein a prescribed sound volume or higher, the preset processing isperformed on the microphone array apparatus that picks up sound in thehighest sound volume, and thus it is possible that even sound in a lowsound volume is heard without any missing. Additionally, even in a casewhere one microphone array apparatus malfunctions, voice in the samepreset position can be heard using a different microphone arrayapparatus.

Furthermore, because monitoring apparatus 300A manages the presetinformation in a unified manner, each microphone array apparatus may nothave a storage unit in which the preset information is stored, and thusa configuration can be simplified. Furthermore, voice processing unit305 does not need to transmit the preset information to each microphonearray apparatus MA, and a processing load can be reduced, leading to areduction in traffic on a network.

First Modification Example

FIG. 37 is a diagram a screen of display 318A that is displayed at thetime of the monitoring in a first modification example of the seventhembodiment. A monitor screen that is divided into nine portions arearranged on portions except for the lower portion of the screen ofdisplay 318A. Image GZ1A to GZ8A and GZ9 that are image-captured incamera apparatuses C11 to C19, respectively, are displayed in a somewhatenlarged manner. Furthermore, pull down menu 360A is positioned on thelower left side of the screen. Furthermore, operation panel 340B ispositioned on the lower right side of the screen. Operation panel 340Bis the same as in FIG. 30 in the sixth embodiment. Unlike in FIG. 30 inthe sixth embodiment, in the first modification example, the voice mapis not displayed.

In a monitoring apparatus in the first modification example, when theuser selects a place from which voice comes, from a plurality ofpictures that are displayed on the screen of display 318A, speaker 319outputs the voice that comes from the image-captured place. Furthermore,when voice processing unit 305 receives the voice data that results fromoccurrence of sound in a high sound volume, picture processing unit 307changes a color of a frame of a picture of a place where the sound in ahigh sound volume occurs, and thus alerts the user to the place wherethe sound occurs. Switching of voice is performed manually orautomatically.

Second Modification Example

In a second modification example of the seventh embodiment, each of theplurality of microphone array apparatuses MA1 to MAn has a storage unit,the monitoring apparatus integrates pieces of preset information thatare received from the plurality of microphone array apparatuses MA1 toMAn, respectively, and creates one preset information table. Themonitoring apparatus stores the created preset information table in atable memory. Furthermore, the integrated pieces of preset information,which are stored as the preset information table in the table memory, istransmitted to each of microphone array apparatuses MA1 to MAn.

Accordingly, even in a case where new microphone array apparatus isconnected, the monitoring apparatus does not need to perform the presetprocessing, and only acquires the preset information from the microphonearray apparatus for integration. Thus, the preset information table inwhich new preset information is listed can be obtained. Furthermore,even in a case where other one monitoring apparatus is added, the presetinformation is transmitted from the microphone array apparatus to otherone monitoring apparatus and the other one monitoring apparatusintegrates pieces of preset information that are transmitted from theplurality of microphone array apparatuses. Thus, the preset informationtable can be obtained. In this manner, a monitoring system in which theplurality of microphone array apparatuses and a plurality of monitoringapparatus are combined can be simply set up.

Third Modification Example

In a third modification example of the seventh embodiment, among theplurality of camera apparatuses C11 to C1 n, one camera apparatus is aPTZ camera that has a pan tiling function of being remotely controllablefrom the monitoring apparatus, a zoom-on function and a zoom-outfunction (which are hereinafter referred to a PTZ function). The PTZcamera sets a place that is specified in advance, to be a preset value,and stores a pan tilting angle and a zoom value in a memory.

In a case where a plurality of preset positions are set to be in the PTZcamera, if among the plurality of camera apparatus C11 to C1 n, fixationcamera other than the PTZ camera is included, the number of times thatpresetting of a timer is performed is the number N of preset thatresults from considering a sum of the number of fixation camera and thenumber of times of presetting of the PTZ camera, not the total number nof cameras.

FIG. 38 is a table illustrating a detail of listing in a presetinformation table 330A in the third modification example of the seventhembodiment. A place, a microphone preset value, a camera IP address, anda camera preset value are listed in preset information table 330A. Thelisting of the microphone preset value and the camera IP address is thesame as in the case of preset information table 330 that is illustrateddin FIG. 32. When it comes to a new camera preset value, in the case ofthe fixation camera, because the image capture position is not changed,the camera preset value is “Null.” On the other hand, in the case of thePTZ camera, the image capture position (in other words, the pointingposition from the microphone array apparatus) that is viewed from thePTZ camera changes with Magazine Shelf T2 and Passage U1, the camerapreset value is “PT1,” and “PT2.”

In a case where a target area (a place) that is image-captured by thePTZ camera, such as Magazine Shelf T2 or Passage U1, is selected, themonitoring apparatus reads the voice data in the pointing direction ofthe microphone array apparatus and, at the same time, transmits thecamera preset value to the PTZ camera. The PTZ camera image-captures apicture in the image capture direction that corresponds to the presetvalue. The image capture area that is the monitoring target can beeasily changed by using the PTZ camera. In the third modificationexample, instead of the fixation camera, the PTZ camera is used, but anomnidirectional camera may be used.

The various embodiments are described above with reference to thedrawings, but it goes without saying that the present disclosure is notlimited to such examples. It is apparent to a person of ordinary skillthat various modification examples or alteration examples can becontemplated within the scope of claims, and it is understood that thesealso justifiably fall into the technical scope of the presentdisclosure.

For example, in the embodiments described above, both end portions(plugs) of the LAN cable that connects to the microphone array and thecamera are inserted the microphone LAN connector with which themicrophone array is equipped and the camera LAN connector with which thecamera is equipped, respectively, and thus installation is performed ina freely attachable and detachable manner. However, the installation maybe performed as follows. That is, the microphone array may not beequipped with the microphone LAN connector. The LAN cable of which oneend may be directly attached within the microphone array is pulled outof the case of the microphone, and the other end may be formed as a plugin such a manner that the other end is inserted into the camera LANconnector. Accordingly, work for installation of the connector on themicrophone side is unnecessary, and the efficiency work is improved.

INDUSTRIAL APPLICABILITY

According to the present disclosure, as a microphone array and amonitoring system that are useful, there are provided a microphonearray, which is connected to a network along with a camera, and in whichan amount of work is reduced when a monitoring system is set up andparticularly when the monitoring system is set up in a state of beingcombined with the camera, and a monitoring system that includes themicrophone array. Additionally, according to the present disclosure, asa monitoring system and a sound pickup setting method, there areprovided a monitoring system and a sound pickup setting method, inwhich, even in a case where a positional relationship between a cameraand a microphone array is unclear, the directionality is suitably formedin a predetermined image capture position, voice in the predeterminedimage capture position is clearly output, and thus an amount of work isreduced when the monitoring system is set up in a state of beingcombined with the camera.

REFERENCE MARKS IN THE DRAWINGS

-   -   3, 3A, 3B, 3C CAMERA    -   3 i DRIVE UNIT    -   3 j POLE    -   3 u CAMERA CASE    -   3 x, 5 x OUTLET    -   3 y, 3 z, 5 y, 5 z, 5 w CASE    -   4 SPACER    -   5, 5A, 5B, 5C MICROPHONE ARRAY    -   5 r, 7 h, 7 u OPENING    -   6 PoE APPARATUS    -   7, 7A, 7B CEILING ATTACHMENT METAL FITTING    -   7C ATTACHMENT METAL FITTING    -   7 k HOLE    -   7 m PROTRUSION MEMBER    -   7 n PLATE    -   7 v PIN HOLE    -   7 p SUPPORT MEMBER    -   7 p 1 BOLT    -   7 p 5 PRESSING PLATE    -   7 p 51 UPPER PLATE    -   7 p 52 LOWER PLATE    -   7 p 53 PRESSURE PLATE    -   7 p 6 RECEPTION RING    -   7 q VERTICAL MEMBER    -   7 q 1, 7 q 2 PROTRUSION MEMBER    -   7 y 1, 7 z LOCKING PIECE    -   9 NETWORK    -   10, 10A, 10B, 10C MONITORING SYSTEM    -   16 SPACE    -   18 CEILING    -   18 z CEILING PLATE    -   19 CEILING BUILDING FRAME    -   19 z FIXATION BOLT    -   19 y NUT    -   31, 31 g CAMERA LAN NETWORK    -   31 z, 38 z, 39 z, 61 z, 62 z, 63 z, 91 z, 92 z INNER WIRING    -   32 CAMERA DC CONNECTOR    -   33, 54, 54A, 79Z FIXATION PIN    -   34, 57 INPUT SWITCH UNIT    -   34 x, 34 y, 57 x, 5′7 y, 58 z INPUT TERMINAL    -   34 z, 57 z, 58 x, 58 y OUTPUT TERMINAL    -   35, 55 HOST CPU    -   36, 56 NETWORK INTERFACE UNIT    -   37, 59 STEP-UP/DOWN POWER SUPPLY CIRCUIT    -   38, 61 PoE ELECTRIC POWER RECEPTION CIRCUIT    -   39, 62 SYSTEM ELECTRIC POWER SUPPLY CIRCUIT    -   40 IMAGING UNIT    -   44, 44A, 44B CEILING FIXATION PIN    -   51, 51A, 52, 52A, 51C, 52C MICROPHONE LAN CONNECTOR    -   53, 53A, 53B MICROPHONE DC CONNECTOR    -   58 OUTPUT SWITCH UNIT    -   63 PoE POWER TRANSMISSION UNIT    -   64 MICROPHONE SET    -   65 MICROPHONE UNIT    -   71, 71 z, 72, 73, 73 z ENGAGEMENT HOLE    -   79 COVER    -   91, 92, 92A, 92 e LAN CABLE    -   93, 94 DC CABLE    -   177 PERIPHERAL WALL ELASTIC PAWL    -   179 CUT    -   205, 205A, 205B MONITORING SYSTEM    -   210 STORE    -   215 NETWORK    -   224 STORAGE UNIT    -   225 CODING UNIT    -   226 TRANSMISSION UNIT    -   230 PC    -   240 RECORDER APPARATUS    -   250 VOICE PROCESSING APPARATUS    -   251 SIGNAL PROCESSING UNIT    -   252 SOUND SOURCE DIRECTION DETECTION UNIT    -   253 DIRECTIONALITY FORMATION UNIT    -   254 INPUT AND OUTPUT CONTROL UNIT    -   255 MEMORY    -   256 TRANSMISSION UNIT    -   257, 317 OPERATION UNIT    -   258, 318 DISPLAY    -   259, 319 SPEAKER    -   265, 265A, 265B VOICE MAP    -   265 h, 265 i, 265 j CONCENTRIC CIRCLE    -   265 m LINE SEGMENT    -   267 CAMERA INFORMATION    -   270 PICTURE PROCESSING APPARATUS    -   271 CAMERA MONITOR    -   281 SPEAKER    -   287, 323 CURSOR    -   288 INPUT BOX    -   290, 330, 330A PRESET INFORMATION TABLE    -   300, 300A MONITORING APPARATUS    -   305 VOICE PROCESSING UNIT    -   307 PICTURE PROCESSING UNIT    -   310 TABLE MEMORY    -   329 INPUT BOX    -   340, 340A, 340B OPERATION PANEL    -   341 LUMINANCE BUTTON    -   342, 342A FOCUS BUTTON    -   343 SELECTION BUTTON    -   345 SOUND VOLUME BUTTON    -   346 PRESET BUTTON    -   346A PRESET INPUT BOX    -   347 ZOOM BUTTON    -   350 MONITOR SCREEN    -   360, 360A PULL DOWN MENU    -   A1 TO An A/D CONVERTER    -   C11 TO C1 n CAMERA APPARATUS    -   CR1 TO CRn IMAGE CAPTURE RANGE    -   FLR FLOOR    -   GZ1 TO GZn, GZ1A TO GZnAIMAGE    -   M1 TO Mn MICROPHONE    -   MA, MA1 TO MAn MICROPHONE ARRAY APPARATUS    -   O CENTRAL POINT    -   P1 TO Pn PRESET POSITION    -   PA1 TO PAn AMPLIFIER (AMP)    -   PKT PACKET    -   RF CEILING    -   SD, SD1 TO SDn, SD3A, SD3B SOUND SOURCE MAP    -   SZ1 TO SZ4 THUMBNAIL

What is claimed is:
 1. A microphone array is included in a monitoringsystem and is connected to a camera and a network, comprising: a soundpickup device that has a plurality of sound pickup elements, andsound-picks up voice; a network interface that has a hub function oftransmitting voice data on the voice that is sound-picked up by thesound pickup device, and image data that is image-captured by the cameraand is transmitted to the microphone array, to the network through afirst signal line; an electric power source that receives first electricpower that is transmitted through the first signal line that is linkedto the network, or second electric power that is transmitted from anouter electric power source; an input switch that switches between thefirst electric power or the second electric power and outputs theresulting electric power; and an output switch that supplies an outputof the input switch to the camera through a second signal line, whichlinks between the microphone array and the camera, or through anelectric power supply line.
 2. The microphone array of claim 1, whereinthe electric power source receives the first electric power that istransmitted through the first signal line, wherein the input switchoutputs the first electric power, and wherein the output switch suppliesan output of the input switch to the camera through the second signalline.
 3. The microphone array of claim 1, wherein the electric powersource receives the first electric power that is transmitted through thefirst signal line, wherein the input switch outputs the first electricpower, and wherein the output switch supplies an output of the inputswitch to the camera through the electric power supply line.
 4. Themicrophone array of claim 1, wherein the electric power source receivesthe second electric power that is transmitted from the outer electricpower source, wherein the input switch outputs the second electricpower, and wherein the output switch supplies an output of the inputswitch to the camera through the second signal line.
 5. The microphonearray of claim 1, wherein the electric power source receives the secondelectric power that is transmitted from the outer electric power source,wherein the input switch outputs the second electric power, and whereinthe output switch supplies an output of the input switch to the camerathrough the electric power supply line.
 6. The microphone array of claim1, wherein an opening that produces a space that is able to accommodatethe second signal line is formed in a rear surface of a case of themicrophone array.
 7. The microphone array of claim 6, wherein an outletthrough which the second signal line is able to be pulled out of thespace is formed a side surface of the case of the microphone array. 8.The microphone array of claim 1, wherein a connector to which the secondsignal line is able to be connected is provided on a rear surface of acase of the microphone array.
 9. The microphone array of claim 1,wherein a connector to which the second signal line is able to beconnected is provided on a side surface of a case of the microphonearray.
 10. The microphone array of claim 1, wherein the networkinterface has a switching function.
 11. The microphone array of claim 1,wherein the network interface has a repeater function.
 12. A monitoringsystem includes a camera that is connected to a network and a microphonearray that is connected to the network, the microphone array comprising;a sound pickup device that has a plurality of sound pickup elements, andsound-picks up voice; a network interface that has a hub function oftransmitting voice data on the voice that is sound-picked up by thesound pickup device, and image data that is image-captured by the cameraand is transmitted to the microphone array, to the network through afirst signal line; an electric power source that receives first electricpower that is transmitted through the first signal line that is linkedto the network, or second electric power that is transmitted from anouter electric power source; an input switch that switches between thefirst electric power or the second electric power and outputs theresulting electric power; and an output switch that supplies an outputof the input switch to the camera through a second signal line thatlinks between the microphone array and the camera, or through anelectric power supply line.